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Preface 


There are many discrete mathematics textbooks available, so why did I decide to invest my time 
and energy to work on something that perhaps only I myself would appreciate? 

Mathematical writings are full of jargon and conventions that, without proper guidance, are 
difficult for beginners to follow. In the past, students were expected to pick them up along the 
way on their own. Those who failed to do so would be left behind. Looking back, I consider 
myself lucky. It was by God’s grace that I survived all those years. Now, when I teach a 
mathematical concept, I discuss its motivation, explain why it is important, and provide a lot 
of examples. I dissect the proofs thoroughly to make sure everyone understands them. In brief, 
I want to show my students how to analyze mathematical problems. 

Most textbooks typically hide all these details. They only show you the final polished 
products. By training, mathematicians love short and elegant proofs. This is reflected in their 
own writing. Yes, the results are beautiful, but it is a mystery how mathematicians come up 
with such ideas. I want a textbook that discusses mathematical concepts in greater detail. I 
want to teach my students how to read and write mathematical arguments. Since I could not 
find a textbook that suited my needs, I started writing lecture notes to supplement the main 
text. Marginal notes, hands-on exercises, summaries, and section exercises were subsequently 
added at different stages. The lecture notes have evolved into a full-length text. 

Discrete mathematics is a rich subject, full of many interesting topics. Often, it is taught to 
both mathematics and computer science majors. Due to the limit in space, this text addresses 
mainly the needs of the mathematics majors. Consequently, we will concentrate on logic and 
proof techniques, and apply them to sets, basic number theory, and functions. In the last two 
chapters, we discuss relations and combinatorics, as many students will find them useful in other 
courses. 

Since the intended audience of the text is mathematics majors, I use a number of examples 
from calculus. By design, I hope this can help the students review what they have learned, and 
see that discrete mathematics forms the foundation of many mathematical arguments. 

Discrete mathematics is often a required course in computer science. I find it hard and 
unjust to serve two different groups of students in the same textbook. Although this text could 
be used in a typical first semester discrete mathematics class for the computer science majors, 
they need to consult another text for the second semester course. Here are two that serve this 
purpose well: 

• Alan Doerr and Kenneth Levasseur, Applied Discrete Structures. 

• Miguel A. Lerma, Notes on Discrete Mathematics. 

Both are available on-line. 

Why do I call this a workbook? There are many hands-on exercises designed to help students 
understand a new concept before they move on to the next. I believe the title Workbook reflects 
the nature of the book, because I expect the students to work on the hands-on exercises. But 
why spiral? Because the pedagogy is inspired by the spiral method. The idea is to revisit 
some themes and results several times throughout the course and each time further deepen 
your understanding. You will find some problems pop up more than once, and are solved in a 
different way each time. In other instances, a concept you learned earlier will be viewed from a 
new perspective, thus adding a new dimension to it. 
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Preface 


I am indebted to the anonymous reviewers, whose numerous valuable comments helped to 
shape the workbook in its current form. I would also like to express my great appreciation to 
Scott Richmond of Reed Library at the State University of New York at Fredonia, who provided 
many helpful suggestions and editorial assistance. 

The reason I developed this workbook is to help students learn discrete mathematics. If this 
workbook proves to be a failure, I am the one to blame. If you find this workbook serves its 
intended purposes, I give all the glory to God, in whom I believe and trust. 


Harris Kwong 
April 21, 2015 
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Chapter 1 


An Introduction 


1.1 An Overview 

What is discrete mathematics? Roughly speaking, it is the study of discrete objects. Here, 
discrete means “containing distinct or unconnected elements.” Examples include: 

• Determining whether a mathematical argument is logically correct. 

• Studying the relationship between finite sets. 

• Counting the number of ways to arrange objects in a certain pattern. 

• Analyzing processes that involve a finite number of steps. 

Here are a few reasons why we study discrete mathematics: 

• To develop our ability to understand and create mathematical arguments. 

• To provide the mathematical foundation for advanced mathematics and computer science 
courses. 

In this text, we will cover these five topics: 

1. Logic and Proof Techniques. Logic allows us to determine if a certain argument is 
valid. We will also learn several basic proof techniques. 

2. Sets. We study the fundamental properties of sets, and we will use the proof techniques 
we learned to prove important results in set theory. 

3. Basic Number Theory. Number theory is one of the oldest branches of mathematics; it 
studies properties of integers. Again, we will use the proof techniques we learned to prove 
some basic facts in number theory. 

4. Relations and Functions. Relations and functions describe the relationship between 
the elements from two sets. They play a key role in mathematics. 

5. Combinatorics . Combinatorics studies the arrangement of objects. For instance, one 
may ask, in how many ways can we form a five-letter word. It is used in many disciplines 
beyond mathematics. 

All of these topics are crucial in the development of your mathematical maturity. The importance 
of some of these concepts may not be apparent at the beginning. As time goes on, you will slowly 
understand why we cover such topics. In fact, you may not fully appreciate the subjects until 
you start taking advanced courses in mathematics. 

This is a very challenging course partly because of its intensity. We have to cover many 
topics that appear totally unrelated at first. This is also the first time many students have to 
study mathematics in depth. You will be asked to write up your mathematical argument clearly, 
precisely, and rigorously, which is a new experience for most of you. 
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Learning how to think mathematically is far more important than knowing how to do all 
the computations. Consequently, the principal objective of this course is to help you develop 
the analytic skills you need to learn mathematics. To achieve this goal, we will show you the 
motivation behind the ideas, explain the results, and dissect why some solution methods work 
while others do not. 


1.2 Suggestions to Students 

All mathematics courses are difficult. It takes hard work and patience to learn mathematics. 
Rote memorization does not work. Here are some suggestions that you may find helpful: 

1. Do not skip classes. 

2. Read the text, including the examples, before the lecture; review what you have learned 
after each lecture. 

3. Do the exercises. 

(a) First, study the examples in the book. 

(b) Make an effort to understand how and why a solution works, and remember how 
certain types of problems should be solved. 

(c) When you do a problem, ask yourself if you have seen something similar before; if 
you have, follow the steps in its solution. 

(d) After solving a problem, look for alternate solutions, analyze and compare their 
differences. 

4. Get help from the instructor, your friends, and whatever facility your college provides. 

5. Develop good study habits. 

(a) Keep working every day: study the book, your own lecture notes, and, most important 
of all, do the exercises at the end of each section. 

(b) Form a study group of two to three students, and meet on a regular basis to study 
together. 

(c) Check the solutions for any nonsense or discrepancies. 

(d) Learn how to solve the problems systematically. 

6. Perseverance. Do not give up easily. 

7. Be willing to help your classmates. Trying to explain something to others is the best way 
to learn anything new. 

Attitude is the real difference between success and failure. Nothing comes easy. To succeed, 
you have to work hard. But you also need to learn how to learn mathematics the right way. 

• Do not rely on memorizing formulas or procedures by rote. Instead, try to understand the 
concepts and ideas behind them. It is important to learn when and how to use them. 

• Of course, it does not mean that you need not memorize anything at all. On the contrary, 
many basic results and definitions need to be memorized. You may find it helpful to use 
a highlighter to mark the definitions and keywords that you have trouble recalling, and I 
urge you to review them frequently. 

• Do not compartmentalize the material; all sections are connected in one way or another. 
Consequently, as you move along from chapter to chapter and from section to section, try 
to observe the connections between the concepts you have learned. Without saying, it is 
understood that you need to remember what you had learned earlier. 
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• Write down all intermediate and partial results clearly. For instance, if the value of x is 7, 
do not just jot down the number 7; instead, write x = 7. Otherwise, you may forget what 
7 is after just a few minutes. In brief, present your results in such a way that they can be 
read and understood by everyone in the class. 

• While we are on the subject, let us comment briefly how to write up a solution. Take your 
homework assignments seriously. Keep in mind: to study for a test, you may want to 
review your homework, so you need to be able to read your own work. Write everything 
clearly and neatly. The process of writing out everything correctly helps you think about 
what you write. Very often, incoherent and incomprehensible writing is an indication of 
lack of understanding of the subject matter. 

• When doing your homework assignments, start with a draft, then look over it carefully, 
check the spelling and grammar, and revise the solution. Make sure you write in complete 
sentences and use correct notations. If necessary, you may have to polish it further. Before 
turning in the final version, be sure to check again for any mistakes that you may have 
overlooked. 

How should a student use this workbook? 

1. Read the workbook before class, and study the workbook again after each class. 

2. Read and study the examples in the workbook. 

3. Do the hands-on exercises. 

4. Do the section exercises. 

1.3 How to Read and Write Mathematics 

Reading mathematics is difficult for beginners. It takes patience and practice to learn how to 
read mathematics. You may need to read a sentence or a paragraph several times before you 
understand it completely. There are writing styles and notational conventions that you acquire 
only by reading and paying attention to how mathematics is written. As we proceed with the 
course, we will discuss the details. As a starter, let us offer several suggestions. 

• Make sure you know the definition of mathematical terms, the meaning and proper usage of 
mathematical symbols and notations. Although this may sound obvious, many beginners 
have difficulty understanding a mathematical argument because they fail to recall the 
exact meaning of certain mathematical concepts. 

• Often, the reason behind a claim lies in the sentence before it. Sometimes it could be 
found in the preceding paragraph, and it is not unusual that you may need to check 
several sentences or paragraphs before it. You need to take an active role in reading 
mathematics, and you need to remember what you have read. 

• Mathematicians prefer short and elegant proofs. To do this, they suppress the details of 
what they consider as “obvious” reasons. But what is obvious to one reader may not be 
that obvious to another. At any rate, for practical reasons, it is impossible to include every 
minute step in a mathematical argument. Consequently, keep your pencil and paper next 
to you, and be ready to check the calculation and fill in the missing details. 

• It may help to try out some examples just to see how an argument works. 

• After you finish reading a proof, go over it one more time, and try to summarize its key 
steps (in other words, try to draw an outline of the proof) in your own words. 

Writing mathematics is even harder! It takes much longer to learn how to write mathematics. 
Of course, the most important thing about a mathematical argument is its correctness. When 
we say “good” mathematical writing, we are talking about precision, clarity, and sound logic. 
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• Be precise! For example, do not just say “it” when it is unclear which quantity you are 
referring to. This is particularly true in a lengthy argument. In this regard, it helps to 
identify and hence distinguish different quantities by their names such as x, y, z, etc. 

• Use mathematical terms correctly! A common mistake is confusing an expression with an 
equation. An equation has an equal sign, as in 

x + y = 5, 

but an expression does not, as in 

x + y. 

• Likewise, the following is an inequality: 

x + y > 5. 

Do not call it an equation! 

• Do not abuse the word “solve.” For instance, many students would say “solve 5 2 + 7 3 .” 
A more appropriate saying should be “compute the value of 5 2 + 7 3 ,” or simply “evaluate 
5 2 + 7 3 .” 

In the beginning, it helps to follow what others do. This again means you need to read a lot of 
mathematical writing, and pick up styles that you are comfortable with. We often follow some 
conventions (unwritten rules, if you prefer) that everyone follows. 

Example 1.3.1 Consider this argument for showing that (x — y)(x + y) = x 2 — y 2 : 


We want to show that 

(x - y)(x + y) = x 2 - y 2 . 

After expanding the product on the left-hand side, 
we find 

= x 2 + xy — yx — y 2 = x 2 — y 2 , 
which is what we want to prove. 

The logic and mathematics in the argument are correct, but not the notation. In formal writing, 
each equation should be a stand-alone equation. The last equation is incomplete, because it does 
not have anything on the left-hand side of the equal sign. Here is a proper way to write the 
argument: 

We want to show that 

(x - y)(x + y) = x 2 - y 2 . 

After expanding the product on the left-hand side, 
we find 

(x - y)(x + y) = x 2 + xy - yx - y 2 = x 2 - y 2 , 
which is what we want to prove. 

The fix is simple: just repeat the left-hand side. A 
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Example 1.3.2 Short and simple mathematical expressions or equations such as a 2 + t> 2 = c 2 
can be written within a paragraph. Longer ones and expressions or equations that are important 
should be displayed separately, and centered, on their own lines, as in 

x 3 - y 3 = (x - y)(x 2 + xy + y 2 ). 

If we intend to refer to the equation later, assign a number to it, and enclose the number within 
parentheses: 

x 2 -y 2 = (x-y)(x + y). (1.1) 

Now, for example, we can say, because of (1.1), we find 

135 = 144 - 9 = 12 2 - 3 2 = (12 - 3)(12 + 3) = 9 • 15. 


For a longer equation such as 

(x + y) 2 = (x + y)(x + y) = x 2 + xy + xy + y 2 = x 2 + 2 xy + y 2 , 

it may look better and easier to follow if we break it up into several lines, and line them up 
along the equal signs: 


(• x + y) 2 = (x + y)(x + y) 

= x 2 + xy + xy + y 2 
= x 2 +2xy + y 2 . 

Although we display the equation in three lines, they together form one equation. The equal 
signs at the beginning of the second and third lines indicate that they are the continuation of 
the previous line. Since this is actually one long equation, we only need to say (a: + y) 2 once, 
namely, at the beginning. 

When part of the right-hand side extends beyond the margin, you may want to balance the 
look of the entire equation by repositioning the left-hand side: 

(a; 2 + 2xy + y 2 )(x 2 + 2 xy + y 2 ) 

= x 4 + 2x 3 y + x 2 y 2 + 2x 3 y + 4:X 2 y 2 + 2 xy 3 + x 2 y 2 + 2xy 3 + y 4 
= x 4 + Ax 3 y + &x 2 y 2 + Axy 3 + y 4 . 


In the multi- line display format, always write the equal signs at the beginning of the lines. Do 
not forget to align the equal signs. 

When part of the right-hand side is too long to display as a single piece, we may split it into 
multiple pieces: 


(x + yf 


(x + y) 2 {x + y) 3 

(x 2 + 2 xy + y 2 )(x 3 + 3 x 2 y + 3 xy 2 + y 3 ) 
x 5 + 2>x 4 y + 3 x 3 y 2 + x 2 y 3 + 2 x 4 y + 6 x 3 y 2 + 6 x 2 y 3 + 2 xy 4 
+ x 3 y 2 + 3 x 2 y 3 + 2>xy 4 + y 5 
x 5 + 5 x 4 y + 10a : 3 y 2 + 10a ,2 y 3 + 5 xy 4 + y 5 . 


It is a common practice to use indentation to indicate the continuation of part of a line into the 
next. A 


There will be more discussion as we continue. Let us not forget: the best way to learn is 
to watch and observe how others do it. Reading is a must! Reading and analyzing technical 
papers will surely improve your mathematical knowledge as well as your writing. 
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1.4 Proving Identities 

There are many methods that one can use to prove an identity. The simplest is to use algebraic 
manipulation, as we have demonstrated in the previous examples. In an algebraic proof, there 
are three acceptable approaches: 

• From left to right : expand or simplify the left-hand side until you obtain the right-hand 
side. 

• From right to left : expand or simplify the right-hand side until you obtain the left-hand 
side. 

• Meet in the middle: expand or simplify the left-hand side and the right-hand side separately 
until you obtain the same result from both sides. 

Example 1.4.1 To prove that 

x 3 - y 3 = (x - y)(x 2 +xy + y 2 ), 

we start from the right-hand side, because it is more complicated than the left-hand side. The 
proof proceeds as follows: 

(x — y)(x + xy + y ) = x— xy + xy — xy + xy — y 

= x 3 ~y 3 . 

Remember: start from one side and work on it until you obtain the other side. ▲ 

Example 1.4.2 The following “proof” of 

4 i 2 2 i 4 /2i i 2\ / 2 , 2\ 

x + x y + y = (x + xy + y )(x — xy + y ) 

is incorrect: 

A wrong proof. 


Here 


at the start of the proof, by convention, we are proclaiming that x 4 + x 2 y 2 + y 4 is indeed equal 
to (x 2 + xy + y 2 ){x 2 — xy + y 2 ). However, this is what we are asked to prove. Before we have 
actually proved that it is true, we do not know yet, whether they are equal. Therefore, it is 
wrong to start the proof with it. ▲ 

Example 1.4.3 For the same reason, the following “proof” of the identity 

x 3 -y 3 = (x- y)(x 2 +xy + y 2 ) 

is unacceptable: 

Another wrong proof. 
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By putting x 3 — y 3 on the left-hand side of every line, this becomes (by convention) a collection 
of three equations. In a nutshell, the argument starts with an equation and we simplify until 
we obtain something we know is true. If this format is valid, we can “prove” that 21 = 6, as 
follows: 


21 = 

6 

6 = 

21 

27 = 

27 


By writing 21 = 6 at the beginning of the proof, what we really say is “ Assume 21 = 6 is true.” 
But this is what we intend to prove. Thus, in effect, we are putting the cart in front of the 
horse, which is logically incorrect. There is another explanation why this proof is incorrect. We 
shall discuss it in Section 2.3. A 


In brief: we cannot start with the given identity and simplify both sides until we obtain an 
equality (or an equation of the form 0 = 0). 

Example 1.4.4 Show that | k(k + 1)(2 k + 1) + [k + l) 2 = g(fc + l)(fc + 2)(2 k + 3). 

Solution 1: We can use the “meet in the middle” approach. Recall that we cannot simplify 
both sides simultaneously. Instead, we should expand the two sides separately , and then compare 
the results. We also suggest adding more writing (in words) to help with the explanation. 

After expansion, the left-hand side becomes 

g k(k + l)(2k + 1) + (fc + l) 2 = g(2fc 3 + 3fc 2 + k) + (k 2 + 2k + 1) 

= | k 3 + | k 2 + -g^ k + 1. 

The right-hand side expands into 

g(fc + l)(/c + 2)(2fc + 3) = g(2fc 3 + 9fc 2 + 13fc + 6) 

= |fc 3 + |fc 2 + f k+1. 

Since both sides yield the same result, they must be equal. 

Although the proof is correct, it requires two sets of computation. It is much easier to use either 
the left-to-right or the right-to-left approach. 


Solution 2: A better alternative is to start from the left-hand side and simplify it until we 
obtain the right-hand side. Our secret weapon is factorization: 


g k{k + l)(2k + 1) + (fc + l) 2 


g(fc + l)[k(2k + 1) + 6(fc + 1)] 
g(fc + l)(2fc 2 + 7fc + 6) 
g (k + l)(fc + 2)(2fc + 3). 


This approach is usually better and safer, because no messy computation is involved. A 

Hands-On Exercise 1.4.1 Show that 

k(k + l)(fc + 2) _j_ + 2) = (k + l)(fc + 2)(fc + 3) 


3 


3 
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Be sure to use one of the three methods we discussed above. 


A 


Summary and Review 

• There are only three ways to prove an identity: left to right, right to left, or meet in the 
middle. 

• Never prove an identity by simplifying both sides simultaneously. 

Exercises 1.4 

1. Let x and y be any real numbers. Prove that 

(x + y) 3 = x 3 + 3 x 2 y + 3 xy 2 + y 3 . 

2. Let x and y be any real numbers. Prove that 

(a - b) A = a 4 - 4a 4 6 + 6a 2 6 2 - lab 3 + b 4 . 

3. Prove that, for any distinct real numbers x and y , 

x 3 ~y 3 2 , .2 

= x + xy + y . 

x-y 


4. Prove that, for any integer k. 

fc(fc + l)(fc + 2)(fc + 3) + (fc + 1)(jfe + 2)(fc + 3) = (fc + l)(fc + 2)(fc + 3)(/c + 4) 

5. Prove that, for any integer k, 


k 2 (k + l) 2 


(k + l) 2 (fc + 2) 2 


4 


+ (fc + l ) 3 


4 
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Logic 


2.1 Propositions 

The rules of logic allow us to distinguish between valid and invalid arguments. Besides mathe- 
matics, logic has numerous applications in computer science, including the design of computer 
circuits and the construction of computer programs. To analyze whether a certain argument is 
valid, we first extract its syntax. 

Example 2.1.1 These two arguments: 

• If x + 1 = 5, then x = 4. Therefore, if x ^ 4, then x + 1 ^ 5. 

• If I watch Monday night football, then I will miss the following Tuesday 8 A.M. class. 
Therefore, if I do not miss my Tuesday 8 A.M. class, then I did not watch football the 
previous Monday night. 

use the same format: 

If p then q. Therefore, if q is false then p is false. 

If we can establish the validity of this type of argument, then we have proved at once that both 
arguments are legitimate. In fact, we have also proved that any argument using the same format 
is also credible. A 

Hands-On Exercise 2.1.1 Can you give another argument that uses the same format in the 
last example? 


A 

In mathematics, we are interested in statements that can be proved or disproved. We define 
a proposition (sometimes called a statement , or an assertion) to be a sentence that is either 
true or false, but not both. 

Example 2.1.2 The following sentences: 

• Barack Obama is the president of the United States. 

• 2 + 3 = 6. 


A 


are propositions, because each of them is either true or false (but not both). 
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Example 2.1.3 These two sentences: 

• Ouch! 

• What time is it? 

are not propositions because they do not proclaim anything; they are exclamation and question, 
respectively. A 


Example 2.1.4 Explain why the following sentences are not propositions: 

(a) x + 1 = 2. 

(b) x - y = y - x. 

(c) A 2 = 0 implies A = 0. 

Solution: (a) This equation is not a statement because we cannot tell whether it is true or false 
unless we know the value of x. It is true when x = 1; it is false for other a:- values. Since the 
sentence is sometimes true and sometimes false, it cannot be a statement. 

(b) For the same reason, since x — y = y ~ x is sometimes true and sometimes false, it cannot 
be a statement. 

(c) This looks like a statement because it appears to be true all the time. Yet, this is not a 

statement, because we never say what A represents. The claim is true if A is a real number, but 
it is not always true if A is a matrix 1 . Thus, it is not a proposition. A 

Hands-On Exercise 2.1.2 Explain why these sentences are not propositions: 

(a) He is the quarterback of our football team. 

(b) x + y = 17. 

(c) AB = BA. 


A 

Example 2.1.5 Although the sentence “x + 1 = 2” is not a statement, we can change it into 
a statement by adding some condition on x. For instance, the following is a true statement: 

For some real number x, we have x + 1 = 2. 

and the statement 

For all real numbers x, we have x + 1 = 2. 

is false. The parts of these two statements that say “for some real number x" and “for all real 
numbers x” are called quantifiers. We shall study them in Section 2.6. A 

1 Some students may not be familiar with matrices. A matrix is rectangular array of numbers. Matrices are 
important tools in mathematics. The product of two matrices of appropriate sizes is defined in a rather unusual 
way. It is the peculiar way that two matrices are multiplied that makes matrices so useful in mathematics. The 
square of a matrix is of course the product of the matrix with itself. It is well-defined only when the matrix is a 
square matrix. As it turns out, the order of multiplication of two matrices is important. In other words, given 
any two matrices A and B, it is not always true that AB = BA. 
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Example 2.1.6 Saying that 

“A statement is not a proposition if we cannot decide whether it is true or false.” 
is different from saying that 

“A statement is not a proposition if we do not know 
how to verify whether it is true or false.” 

The more important issue is whether the truth value of the statement can be determined in 
theory. Consider the sentence 

Every even integer greater than 2 can be written as the sum of two primes. 

Nobody has ever proved or disproved this claim, so we do not know whether it is true or false, 
even though computational data suggest it is true. Nevertheless, it is a proposition because it 
is either true or false but not both. It is impossible for this sentence to be true sometimes, and 
false at other times. With the advancement of mathematics, someone may be able to either 
prove or disprove it in the future. The example above is the famous Goldbach Conjecture , 
which dates back to 1742. A 

We usually use the lowercase letters p, q and r to represent propositions. This can be 
compared to using variables x, y and z to denote real numbers. Since the truth values of p , 
q , and r vary, they are called propositional variables. A proposition has only two possible 
values: it is either true or false. We often abbreviate these values as T and F, respectively. 

Given a proposition p, we form another proposition by changing its truth value. The result 
is called the negation of p, and is denoted ->p or ~ p, both of which are pronounced as “not 
p.” The similarity between the notations ~<p and —x is obvious. 

We can also write the negation of p as p , which is pronounced as “p bar.” The truth value 
of p is opposite of that of p. Hence, if p is true, then p would be false; and if p is false, then p 
would be true. We summarize these results in a truth table: 


p 

P 

T 

F 

F 

T 


Example 2.1.7 Find the negation of the following statements: 

(a) George W. Bush is the president of the United States. 

(b) It is not true that New York is the largest state in the United States. 

(c) a: is a real number such that x = 4. 

(d) a: is a real number such that x < 4. 

If necessary, you may rephrase the negated statements, and change a mathematical notation to 
a more appropriate one. 

Solution: (a) George W. Bush is not the president of the United States. 

(b) It is true that New York is the largest state in the United States. 

(c) The phrase “x is a real number” describes what kinds of numbers we are considering. The 
main part of the proposition is the proclamation that x = 4. Hence, we only need to negate 
“x = 4” . The answer is: 

a; is a real number such that x ^ 4. 

(d) a: is a real number such that x > 4. A 

Hands-On Exercise 2.1.3 Negate the following statements: 

(a) a: is an integer greater than 7. 
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The meaning of kS. 


(b) We can factor 144 into a product of prime numbers. 


(c) The number 64 is a perfect square. 


A 


Since we will be studying numbers throughout this course, it is convenient to introduce some 
notations to facilitate our discussion. Let 

N = the set of natural numbers (positive integers), 

Z = the set of integers, 

ffi. = the set of real numbers, and 

Q = the set of rational numbers. 

Recall that a rational number is a number that can be expressed as a ratio of two integers. 
Hence, a rational number can be written as — for some integers m and n, where n ^ 0. If you 
use a word processor, and cannot find, for example, the symbol N, you may use bold face N as 
a replacement. 

We usually use uppercase letters such as A , B , C, S and T to represent sets, and denote 
their elements by the corresponding lowercase letters a, b, c, s, and t, respectively. To indicate 
that b is an element of the set B } we adopt the notation 

b € B [pronounced as “6 belongs to B"]. 

Occasionally, we also use the notation 

B 9 b [pronounced as “ B contains &”]. 

Consequently, saying x € R. is another way of saying £ is a real number. 

Denote the set of positive real numbers, the set of negative real numbers, and the set of 
nonzero real numbers, by inserting the appropriate sign in the superscript: 

R + = the set of all positive real numbers, 

R _ = the set of all negative real numbers, 

R* = the set of all nonzero real numbers. 

The same convention applies to Z and Q. Notice that Z + is same as N. 

In addition, if S' is a set of numbers, and k is a number, we sometimes use the notation kS 
to indicate the set of numbers obtained by multiplying k to every number in S. 

Example 2.1.8 The notation 2Z denotes the set of all even integers. Take note that an even 
integer can be positive, negative, or even zero. A 

Summary and Review 

• A proposition (statement or assertion) is a sentence which is either always true or always 
false. 

• The negation of the statement p is denoted ->p, ~p, or p. 

• We can describe the effect of a logical operation by displaying a truth table which covers 
all possibilities (in terms of truth values) involved in the operation. 
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• The notations R, Q, Z, and N represent the set of real numbers, rational numbers, integers, 
and natural numbers (positive integers), respectively. 

• If S denotes a set of numbers, S + means the set of positive numbers in S, S~ means the 
set of negative numbers in S, and S* means the set of nonzero numbers in S. 

• If S denotes a set of numbers, and k is a real number, then kS means the set of numbers 
obtained by multiplying k to every number in S. 


Exercises 2.1 


1. Indicate which of the following are propositions (assume that x and y are real numbers). 

(a) The integer 36 is even. 

(b) Is the integer 3 15 — 8 even? 

(c) The product of 3 and 4 is 11. 

(d) The sum of x and y is 12. 

(e) If x > 2, then x 2 > 3. 

(f) 5 2 — 5 + 3. 

2. Which of the following are propositions (assume that £ is a real number)? 

(a) 2-7T + 57 t = 77t. 

(b) The product of x 2 and x 3 is x 6 . 

(c) It is not possible for 3 15 — 7 to be both even and odd. 

(d) If the integer x is odd, is x 2 odd? 

(e) The integer 2 524287 — 1 is prime. 

(f) 1.7 + .2 = 4.0. 

3. Determine the truth values of these statements: 

(a) The product of x 2 and x 3 is x 6 for any real number x. 

(b) x 2 > 0 for any real number x. 

(c) The number 3 15 — 8 is even. 

(d) The sum of two odd integers is even. 

4. Determine the truth values of these statements: 

(a) 7T e z. 

(b) l 3 + 2 3 + 3 3 = 3 2 • 4 2 /4. 

(c) u is a vowel. 

(d) This statement is both true and false. 

5. Negate the statements in Problem 4. 

6. Determine the truth values of these statements: 


(a) y/2 G Z (b) -1 i Z+ 

(d) 7T G R (e) | G Q 

7. Determine whether these statements are true or false: 


(c) 0 G N 
(f) 1.5 G Q 


(a) 0 G Q (b) 0 G Z (c) -4 G Z 

(d) -4 G N (e) 2 G 3Z (f) -18 G 3Z 

8. Negate the following statements about the real number x: 


(a) x > 0 


(b) x < — 5 


(c) 7 < x 


9. Explain why 7Q = Q. Is it still true that 0Q = Q? 

10. Find the number(s) k such that fcZ = Z. 
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2.2 Conjunctions and Disjunctions 

Given two real numbers x and y, we can form a new number by means of addition, subtraction, 
multiplication, or division, denoted x + y, x — y, x ■ y. and x/y, respectively. The symbols +, — , 
• , and / are binary operators because they all work on two operands . In fact, the negative 
sign in —x can be regarded as a unary operator that changes the sign of x. 

In a similar manner, from one or more logical statements, we can form a compound state- 
ment by joining them with logical operators , which are also called logical connectives 
because they are used to connect logical statements. Obviously, negation is a unary operation. 

Since a compound statement is itself a statement, it is either true or false. Therefore, we 
define a logical operation by describing the truth value of the resulting compound statement. 
The first two binary operations we shall study are conjunction and disjunction. They perform 
the “and” and “or” operations, respectively. 


name 

meaning 

notation 

truth value 

conjunction 

p and q 

pAq 

true if both p and q are true, false otherwise 

disjunction 

p or q 

PV q 

false if both p and q are false, true otherwise 


Their truth values are summarized in the following truth table: 


P 

q 

pAq 

PV q 

T 

T 

T 

T 

T 

F 

F 

T 

F 

T 

F 

T 

F 

F 

F 

F 


Example 2.2.1 Do not use mathematical notations as abbreviation in writing. For example, 
do not write “x A y are real numbers” if you want to say “x and y are real numbers.” 

In fact, the phrase “x A y are real numbers” is syntactically incorrect. Since A is a binary 
logical operator, it is used to connect two logical statements. Here, the “x” before A is not a 
logical statement. Therefore we cannot write “x A y are real numbers.” 

Incidentally, the statement u x and y are real numbers” is actually a conjunction. It means 
“x is a real number and y is a real number,” or symbolically, 

(x £ R) A (y € R). 

It is wrong to write “x A y € R.” Can you explain why? A 

Hands-On Exercise 2.2.1 Write “x and y are rational” as a conjunction, first in words, then 
in mathematical symbols. 


A 

Example 2.2.2 The statement “New York is the largest state in the United States and New 
York City is the state capital of New York” is clearly a conjunction. A conjunction of two 
statements is true only when both statements are true. Since New York is not the largest state 
in the United States, the conjunction is false. 

In general, in a conjunction of two statements, if the first statement is false, no further 
consideration of the second statement is necessary since we know the conjunction must be false. 
In computer science, this is referred to as the short circuit evaluation. A 

Example 2.2.3 The statement “\/30 is greater than 6 or -\/30 is less than 5” can be expressed 
symbolically as 

(-v/30 > 6) V (a/30 < 5). 

Both statements “a/ 30 > 6” and “\/30 < 5” are false. Hence, their disjunction is also false. A 
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Example 2.2.4 Determine the truth values of the following statements: 

(a) (-s/30 > 5) A (v/30 > 7) 

(b) Either (s/30 < 5) or (/30 > 7) 

Solution: (a) Since s/30 > 5 is true, but s/30 > 7 is false, their conjunction is false. 

(b) Since \/30 < 5 is false, and s/30 > 7 is also false, their disjunction is false. A 

Hands-On Exercise 2.2.2 Determine the truth values of the following statements: 

(a) (s/M < 5) and (s/30 > 7). 

(b) (s/30 > 5) V (s/30 < 7). 


Be sure to show your reasons. A 

Example 2.2.5 What does “0 < x < 1” really mean, logically? 

Solution: It means the conjunction “(0 < a:) A (x < 1).” Hence, given a real number x, to test 
whether 0 < x < 1, we have to check whether 0 < x and x < 1. A 

Hands-On Exercise 2.2.3 Write 5 < x < 8 as a conjunction. 


A 

Hands-On Exercise 2.2.4 Many students assume that they can negate “0 < x < 1” by 
reversing the signs. However, neither “0 > x > 1” nor “0 > x > 1” is the correct negation. 
For example, what does “0 > x > 1” really mean? Actually, the statement “0 > x > 1” is 
syntactically correct, and it is always false. Can you explain why? 


A 

In the everyday usage of most languages, when we say “p or q,” we normally mean exclusive 
or, which means either p or q is true, but not both. An example is “I either pass or fail this 
course,” which really means 

Either I pass this course or I fail this course. 

Sometimes, as illustrated in the statement 

Either you pass this course, or I pass this course. 

the connective “or” can be interpreted as an inclusive or. The actual meaning of “or” in human 
languages depends on the context. In mathematics, however, “or” always means inclusive or. 


“0 < x < 1” means 
“0 < x and x < 1.” 


The negation of 
“0 < x < 1” is not 
“0 > x > 1.” 
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Summary and Review 

• The conjunction “p and q" is denoted “p A q” . It is true only when both p and q are true. 

• The disjunction u p or q” is denoted “p V q ” . It is false only when both p and q are false. 

• The inequality “a < x < b" is actually a conjunction, it means “(a < x) A (x < b)” . 

• Likewise, the phrase “x and y are rational” is also a conjunction, it means “x is rational 
and y is rational.” Symbolically, we can write ”x G Q A y G Q.” 

Exercises 2.2 

1. Let p, q, and r represent the following statements: 

p: Sam had pizza last night. 

q: Chris finished her homework. 

r: Pat watched the news this morning. 

Give a formula (using appropriate symbols) for each of these statements: 

(a) Sam had pizza last night and Chris finished her homework. 

(b) Chris did not finish her homework and Pat watched the news this morning. 

(c) Sam did not have pizza last night or Chris did not finish her homework. 

(d) Either Chris finished her homework or Pat watched the news this morning, but not 
both. 

2. Define the propositional variables p, q, and r as in Problem 1. Express, in words, the 
statements represented by the following formulas: 

(a) pVg 

(b) q A r 

(c) (p A q) V r 

(d) pVr 

3. Consider the following statements: 

p: Niagara Falls is in New York. 

q: New York City is the state capital of New York. 

r: New York City will have more than 40 inches of snow in 2525. 

The statement p is true, but the statement q is false. Represent each of the following 
statements by a formula. What are their truth values if r is true? What if r is false? 

(a) Niagara Falls is in New York and New York City is the state capital of New York. 

(b) Niagara Falls is in New York or New York City is the state capital of New York. 

(c) Either Niagara Falls is in New York and New York City is the state capital of New 
York, or New York City will have more than 40 inches of snow in 2525. 

(d) New York City is not the state capital of New York and New York City will have 
more than 40 inches of snow in 2525. 

4. Determine the truth values of these statements: 

(a) (0 € Q) A (-4 G Z) (b) (-4 G N) V (3 G 2Z) 

5. Determine the truth values of these statements: 

(a) (-3 > -2) A (x/3 > 2) (b) (4 2 - 5 2 < 0) V (a/3 2 + 4 2 = 3 + 4) 

6. Construct the truth tables for the following formulas: 

(a)pAg (b )pVq 


(c) p A q 
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7. Rewrite the following expressions as conjunction: 

(a) 4 < x < 7 (b) 4 < x < 7 (c) 4 < x < 7 

8. In words, the inequality 0 < x < 1 means “ x is between 0 and 1.” Its negation means x 

is outside this range. Hence, the negation is a x < 0 or x > 1.” Find the negation of the 
following inequalities: 

(a) 0 < x < 4 (b) — 2 < x < 5 (c) 1.76 < x < VE 

9. In volleyball it is important to know which team is serving, because a team scores a point 

only if that team is serving and wins a volley. If the serving team loses the volley, then 
the other team gets to serve. Thus, to keep score in a volleyball game between teams A 
and B , it may be useful to define propositional variables p and q , where p is true if team 

A is serving (hence false if team B is serving); and q is true if team A wins the current 

volley (hence false if team B wins it). 

(a) Give a formula that is true if team A scores a point and is false otherwise. 

(b) Give a formula that is true if team B scores a point and is false otherwise. 

(c) Give a formula that is true if the serving team loses the current volley and is false 
otherwise. 

(d) Give a formula whose truth value determines whether the serving team will serve 
again. 

10. The exclusive or operation, denoted p Y. q, means “p or q, but not both.” 

(a) Express p Y. q as a logic statement. 

(b) Construct the truth table for p Y q. 


2.3 Implications 

Most theorems in mathematics appear in the form of compound statements called conditional 
and biconditional statements. We shall study biconditional statement in the next section. Con- 
ditional statements are also called implications. 


Definition. An implication is the compound statement of the form “if p, then q .” It is 
denoted p q, which is read as “p implies q .” It is false only when p is true and q is false, and 
is true in all other situations. 


V 

q 

p=> q 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 


The statement p in an implication p => q is called its hypothesis , premise , or antecedent , 
and q the conclusion or consequence. <£> 


Implications come in many disguised forms. There are several alternatives for saying p => q. 
The most common ones are 

• p implies q, 

• p only if q, 

• Q if P: 

• q, provided that p. 


All of them mean p q. 
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Implications play a key role in logical argument. If an implication is known to be true, 
then whenever the hypothesis is met, the consequence must be true as well. This is why an 
implication is also called a conditional statement. 

Example 2.3.1 The quadratic formula asserts that 

b 2 — Aac >0 => ax 2 + bx + c = 0 has two distinct real solutions. 

Consequently, the equation x 2 — 3a; + 1 = 0 has two distinct real solutions because its coefficients 
satisfy the inequality b 2 — Aac >0. A 

Hands-On Exercise 2.3.1 More generally, 

• lib 2 — Aac > 0, then the equation ax 2 + bx + c = 0 has two distinct real solutions. In fact, 
ax 2 + bx + c = a(x — rr)(x — r 2 ), where r\ ^ r<i are the two distinct roots. 

• If b 2 — Aac = 0, then the equation ax 2 + bx + c = 0 has only one real solution r. In such 
an event, ax 2 + bx + c = a{x — r) 2 . Consequently, we call r a repeated root. 

• lib 2 — Aac = 0, then the equation ax 2 + bx + c = 0 has no real solution. 

Use these results to determine how many solutions these equations have: 

(a) 4x 2 + 12a; + 9 = 0 

(b) 2a; 2 — 3a: — 4 = 0 

(c) a; 2 + x = — 1 


A 

Example 2.3.2 We have remarked earlier that many theorems in mathematics are in the form 
of implications. Here is an example: 

If |r| < 1, then 1 + r + r 2 + r 3 + • • • = 

It means, symbolically, |r| < 1 => 1 + r + r 2 + r 3 + • • • = j^p. A 

Hands-On Exercise 2.3.2 Express the following statement in symbol: 

If x > y > 0, then x 2 > y 2 . 


A 

Example 2.3.3 If a father promises his kids, “If tomorrow is sunny, we will go to the beach,” 
the kids will take it as a true statement. Consequently, if they wake up the next morning and 
find it sunny outside, they expect they will go to the beach. The father breaks his promise 
(hence making the implication false) only when it is sunny but he does not take his kids to the 
beach. 

If it is cloudy outside the next morning, they do not know whether they will go to the beach, 
because no conclusion can be drawn from the implication (their father’s promise) if the weather 
is bad. Nonetheless, they may still go to the beach, even if it rains! Since their father does not 
contradict his promise, the implication is still true. A 

Many students are bothered by the validity of an implication even when the hypothesis is 
false. It may help if we understand how we use an implication. Assume we want to show that 
a certain statement q is true. 
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(i) First, we find a result of the form p q. If we cannot find one, we have to prove that 
p => q is true. 

(ii) Next, show that the hypothesis p is fulfilled. 

(iii) These two steps together allow us to draw the conclusion that q must be true. 

Consequently, if p is false, we are not expected to use the implication p => q at all. Since we are 
not are going to use it, we can define its truth value to anything we like. Nonetheless, we have 
to maintain consistency with other logical connectives. We will give a justification of our choice 
at the end of the next section. 

Example 2.3.4 To show that “if x = 2, then x 2 = 4” is true, we need not worry about those 
z-values that are not equal to 2, because the implication is immediately true if x ^ 2. It suffices 
to assume that x = 2, and try to prove that we will get x 2 = 4. Since we do have x 2 = 4 when 
x = 2, the validity of the implication is established. 

In contrast, to determine whether the implication “if x 2 = 4, then x = 2” is true, we assume 
x 2 = 4, and try to determine whether x must be 2. Since x = — 2 makes x 2 = 4 true but x = 2 
false, the implication is false. 

In general, to disprove an implication, it suffices to find a counterexample that makes the 
hypothesis true and the conclusion false. A 

Hands-On Exercise 2.3.3 Determine whether these two statements are true or false: 

(a) If ( x — 2)(x — 3) = 0, then x = 2. 

(b) If x = 2, then {x — 2)(x — 3) = 0. 

Explain. 


A 

Example 2.3.5 Although we said examples can be used to disprove a claim, examples alone 
can never be used as proofs. If you are asked to show that 

if x > 2, then x 2 > 4, 

you cannot prove it by checking just a few values of x , because you may find a counterexample 
after trying a few more calculations. Therefore, examples are only for illustrative purposes, they 
are not acceptable as proofs. A 

Example 2.3.6 The statement 

“If a triangle PQR is isosceles, then two of its angles have equal measure.” 
takes the form of an implication p => q, where 

p : The triangle PQR is isosceles 

q : Two of the angles of the triangle PQR have equal measure 

In this example, we have to rephrase the statements p and q , because each of them should be 
a stand-alone statement. If we leave q as “two of its angles have equal measure,” it is not clear 
what “its” is referring to. In addition, it is a good habit to spell out the details. It helps us 
focus our attention on what we are investigating. A 

Example 2.3.7 The statement 


We can use a 
counterexample to 
disprove a claim. 


Examples cannot 
be used as proofs. 


A square must also be a parallelogram. 
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can be expressed as an implication: “if the quadrilateral PQRS is a square, then the quadrilat- 
eral PQRS is a parallelogram.” 

Likewise, the statement 

“All isosceles triangles have two equal angles.” 

can be rephrased as “if the triangle PQR is isosceles, then the triangle PQR has two equal 
angles.” Since we have expressed the statement in the form of an implication, we no longer need 
to include the word “all.” A 


Hands-On Exercise 2.3.4 Rewrite each of these logical statements: 

(a) Any square is also a parallelogram. 


(b) A prime number is an integer. 


(c) All polynomials are differentiable. 


as an implication p => < 7 . Specify what p and q are. A 

Example 2.3.8 What does “p unless q" translate into, logically speaking? We know that p is 
true, provided that q does not happen. It means, in symbol, q =>• p. Therefore, 

The quadrilateral PQRS is not a square 
unless the quadrilateral PQRS is a parallelogram 


is the same as saying 


If a quadrilateral PQRS is not a parallelogram, 
then the quadrilateral PQRS is not a square. 

Equivalently, “p unless (/” means p => q, because q is a necessary condition that prevents p from 
happening. A 

Given an implication p => q 1 we define three related implications: 

• Its converse is defined as q => p. 

• Its inverse is defined as p => q. 

• Its contrapositive is defined as q =>■ p. 

Among them, the contrapositive q =>• p is the most important one. We shall study it again in 
the next section. 

Example 2.3.9 The converse, inverse, and contrapositive of “x > 2 => x 2 > 4” are listed 
below. 

converse: x 2 > 4 => x > 2, 

inverse: x < 2 => x 2 < 4, 

contrapositive: x 2 < 4 => x <2. 

We can change the notation when we negate a statement. If it is appropriate, we may even 
rephrase a sentence to make the negation more readable. A 
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Hands-On Exercise 2.3.5 List the converse, inverse, and contrapositive of the statement “if 
p is prime, then ^ [p is irrational.” 


A 

The inverse of an implication is seldom used in mathematics, so we will only study the truth 
values of the converse and contrapositive. 


p 

q 

p => q 

q=>p 

q 

P 

q^p 

T 

T 

T 

T 

F 

F 

T 

T 

F 

F 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

T 

F 

F 

T 

T 

T 

T 

T 


An implication and its contrapositive always have the same truth value, but this is not true for 
the converse. What this means is, even though we know p => q is true, there is no guarantee 
that q =t> p is also true. This is an important observation, especially when we have a theorem 
stated in the form of an implication. So let us say it again: 

The converse of a theorem in the form of an implication may not be true. 

Accordingly, if you only know that p => q is true, do not assume that its converse q => p is also 
true. Likewise, if you are asked to prove that p => q is true, do not attempt to prove q =$■ p, 
because these two implications are not the same. 

Example 2.3.10 We know that p => q does not necessarily mean we also have q => p. This 
important observation explains the invalidity of the “proof” of 21 = 6 in Example 1.4.3. 


21 = 

6 

6 = 

21 

27 = 

27 


The argument we use here consists of three equations, but they are not individual unrelated 
equations. They are connected by implication. 



21 = 

6 

=> 

6 = 

21 

=> 

27 = 

27 


Since implications are not reversible, even though we do have 27 = 27, we cannot use this fact 
to prove that 21 = 6. After all, an implication is true if its hypothesis is false. Therefore, having 
a true implication does not mean that its hypothesis must be true. In this example, the logic is 
sound, but it does not prove that 21 = 6. ▲ 

There are two other ways to describe an implication p => q in words. They are completely 
different from the ones we have seen thus far. They focus on whether we can tell one of the two 
components p and q is true or false if we know the truth value of the other. 
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• p is a sufficient condition for q 

• q is a necessary condition for p. 

They are difficult to remember, and can be easily confused. You may want to visualize it 
pictorially: 


sufficient condition =>■ necessary condition. 


The idea is, assuming that p => q is true, then 

• For q to be true, it is enough to know or show that p is true. Hence, knowing p is true 
alone is sufficient for us to draw the conclusion the q must also be true. 

• For p to be true, it is necessary to have q be true as well. Thus, knowing q is true does 
not necessarily mean that p must be true. 


Example 2.3.11 Consider the implication 


x = 1 => x 2 = 1. 

If x = 1, we must have x 2 = 1. So, knowing x = 1 is enough for us to conclude that x 2 = 1. We 
say that x = 1 is a sufficient condition for x 2 = 1. 

If x = 1, it is necessarily true that x 2 = 1, because, for example, it is impossible to have 
x 2 = 2. Nonetheless, knowing x 2 = 1 alone is not enough for us to decide whether x = 1, 
because x can be — 1. Therefore, x 2 = 1 is not a sufficient condition for x = 1. Instead, x 2 = 1 
is only a necessary condition for x = 1. A 


Hands-On Exercise 2.3.6 Write these statements: 

(a) For x 2 > 1, it is sufficient that x > 1. 

(b) For x 2 > 1, it is necessary that x > 1. 

in the form of p => q. Be sure to specify what p and q are. 


A 


Summary and Review 

• An implication p =>■ q is false only when p is true and q is false. 

• This is how we typically use an implication. Assume we want to show that q is true. We 
have to find or prove a theorem that says p => q. Next, we need to show that hypothesis 
p is met, hence it follows that q must be true. 

• An implication can be described in several other ways. Can you name a few of them? 

• Converse, inverse, and contrapositive are obtained from an implication by switching the 
hypothesis and the consequence, sometimes together with negation. 

• In an implication p =$■ q, the component p is called the sufficient condition, and the 
component q is called the necessary condition. 
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Exercises 2.3 

1. Let p, q , and r represent the following statements: 

p: Sam had pizza last night. 

q: Chris finished her homework, 

r: Pat watched the news this morning. 

Give a formula (using appropriate symbols) for each of these statements: 

(a) If Sam had pizza last night then Chris finished her homework. 

(b) Pat watched the news this morning only if Sam had pizza last night. 

(c) Chris finished her homework if Sam did not have pizza last night. 

(d) It is not the case that if Sam had pizza last night, then Pat watched the news this 
morning. 

(e) Sam did not have pizza last night and Chris finished her homework implies that Pat 
watched the news this morning. 

2. Define the propositional variables as in Problem 1. Express in words the statements 
represented by the following formulas. 

(a) q =>■ r 

(b) p=>{qAr) 

(c) p=>(q\Jr) 

(d) r => {jp V q) 

3. Consider the following statements: 

p: Niagara Falls is in New York. 

q: New York City is the state capital of New York. 

r: New York City will have more than 40 inches of snow in 2525. 

The statement p is true, and the statement q is false. Represent each of the following 
statements by a formula. What is their truth value if r is true? What if r is false? 

(a) If Niagara Falls is in New York, then New York City is the state capital of New York. 

(b) Niagara Falls is in New York only if New York City will have more than 40 inches of 
snow in 2525. 

(c) Niagara Falls is in New York or New York City is the state capital of New York 
implies that New York City will have more than 40 inches of snow in 2525. 

(d) For New York City to be the state capital of New York, it is necessary that New York 
City will have more than 40 inches of snow in 2525. 

(e) For Niagara Falls to be in New York, it is sufficient that New York City will have 
more than 40 inches of snow in 2525. 

4. Express each of the following compound statements symbolically: 

(a) If the triangle ABC is equilateral, then it is isosceles. 

(b) If -\/47089 is greater than 200 and -\/47089 is an integer, then -\/47089 is prime. Recall that Z means 

(c) If -s/47089 is greater than 200, then, if -\/47089 is prime, it is greater than 210. the set of all integers. 

(d) The line Li is perpendicular to the line L 2 and the line L 2 is parallel to the line L 3 
implies that L i is perpendicular to L 3 . 

(e) If x 3 — 3a; 2 + x — 3 = 0, then either x is positive or x is negative or x = 0. 

5. Express each of the following compound statements in symbols. 

(a) x 3 — 3x 2 + x — 3 = 0 only if x = 3. 

(b) A necessary condition for a: 3 — 3a: 2 + a; — 3 = 0isa; = 3. 
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(c) A sufficient condition for x 3 — 3x 2 + x — 3 = 0 is £ = 3. 

(d) If e 7r is a real number, then e 7r is either rational or irrational. 

(e) All NFL players are huge. 

6. Find the converse, inverse, and contrapositive of the following implication: 

If the quadrilateral ABCD is a rectangle, then ABCD is a parallelogram. 

7. Construct the truth tables for the following expressions: 

(a) (p A g) V r (b) (p V q) => (p A r) 


Hint: To help you get started, fill in the blanks. 


p 

q 

r 

pAg 

(p A g) V r 

T 

T 

T 



T 

T 

F 



T 

F 

T 



T 

F 

F 



F 

T 

T 



F 

T 

F 



F 

F 

T 



F 

F 

F 





8. Construct the truth tables for the following expressions: 

(a) (p => q) V (p => q) (b) (p => q) A (p => q) 

9. Determine (you may use a truth table) the truth value of p if 

(a) (pAq) =» (gV r) is false (b) (gAr) => (pA q) is false 

10. Assume p q is true. 

(a) If p is true, must q be true? Explain. 

(b) If p is false, must q be true? Explain. 

(c) If q is true, must p be false? Explain. 

(d) If q if false, must p be false? Explain. 


2.4 Biconditional Statements 


The biconditional statement “p if and only if q” denoted p <t=> q, is true when both p and q 
carry the same truth value, and is false otherwise. It is sometimes abbreviated as “p iff q.” Its 
truth table is depicted below. 


P 

q 

po q 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

T 


Example 2.4.1 The following biconditional statements 

• 2x — 5 = 0 aax = 5/2, 

• x>yO-x — y> 0, 







2.4 Biconditional Statements 


25 


are true, because, in both examples, the two statements joined by <t=> are true or false simulta- 
neously. A 

A biconditional statement can also be defined as the compound statement 

(p=>q) A {q^p)- 

This explains why we call it a biconditional statement. A biconditional statement is often used 
to define a new concept. 

Example 2.4.2 A number is even if and only if it is a multiple of 2. Mathematically, this 
means 

n is even 4$ n = 2q for some integer q. 

It follows that for any integer to, 

mn = to • 2q = 2 (mq). 

Since mq is an integer (because it is a product of two integers), by definition, mn is even. This 
shows that the product of any integer with an even integer is always even. A 

Hands-On Exercise 2.4.1 Complete the following statement: 

n is odd -£=> 

Use this to prove that if n is odd, then n 2 is also odd. 


A 

Example 2.4.3 The operation “exclusive or” can be defined as 

pVq <t=> (pVg)A(pAg). 

See Problem 10 in Exercises 2.2. A 

When we have a complex statement involving more than one logical operation, care must be 
taken to determine which operation should be carried out first. The precedence or priority is 
listed below. 


Connectives 

Priority 

— 1 

Highest 

A 


V 


=> 


<S=> 

Lowest 


This is the order in which the operations should be carried out if the logical expression is read 
from left to right. To override the precedence, use parentheses. 

Example 2.4.4 The precedence of logical operations can be compared to those of arithmetic 
operations. 
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Operations 

Priority 

— (Negative) 

Highest 

Exponentiation 


Multiplication/Division 


Addition/Subtraction 

Lowest 


For example, yz 3 ^ (yz) 3 . To evaluate yz 3 , we have to perform exponentiation first. Hence, 
yz~ 3 = y-z~' 3 = ^. 

Another example: the notation x 2 means x raised to the power of 2 3 , hence x 2 = x 8 -, it 
should not be interpreted as (a; 2 ) 3 , because (a; 2 ) 3 = a: 6 . ▲ 

Example 2.4.5 It is not true that p <t=> q can be written as “p => q A q => p,” because it would 
mean, technically, 

P => {q A q) => p. 

The correct notation is (p q) A (q => p) . ▲ 

Hands-On Exercise 2.4.2 Insert parentheses in the following formula 

p => q A r 

to identify the proper procedure for evaluating its truth value. Construct its truth table. 


A 


Hands-On Exercise 2.4.3 Insert parentheses in the following formula 

p A q <t=> p V q. 

to identify the proper procedure for evaluating its truth value. Construct its truth table. 


A 


We close this section with a justification of our choice in the truth value of p => q when p is 
false. The truth value of p => q is obvious when p is true. 


P 

q 

q 

T 

T 

T 

T 

F 

F 

F 

T 

? 

F 

F 

? 


We want to decide what are the best choices for the two missing values so that they are consistent 
with the other logical connectives. Observe that if p => q is true, and q is false, then p must be 
false as well, because if p were true, with q being false, then the implication p => q would have 
been false. For instance, if we promise 
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“If tomorrow is sunny, we will go to the beach” 

but we do not go to the beach tomorrow, then we know tomorrow must not be sunny. This 
means the two statements p => q and q => p should share the same truth value. 

When both p and q are false, then both p and q are true. Hence q => p should be true, 
consequently so is p => q. Thus far, we have the following partially completed truth table: 


p 

q 

p=> q 

T 

T 

T 

T 

F 

F 

F 

T 

? 

F 

F 

T 


If the last missing entry is F, the resulting truth table would be identical to that of p <t=> q. To 
distinguish p <=> q from p => g, we have to define p => q to be true in this case. 

Summary and Review 

• A biconditional statement p <=> q is the combination of the two implications p q and 
<? =>P- 

• The biconditional statement p •o- q is true when both p and q have the same truth value, 
and is false otherwise. 

• A biconditional statement is often used in defining a notation or a mathematical concept. 

Exercises 2.4 

1. Let p, g, and r represent the following statements: 

p: Sam had pizza last night. 

q: Chris finished her homework. 

r: Pat watched the news this morning. 

Give a formula (using appropriate symbols) for each of these statements. 

(a) Sam had pizza last night if and only if Chris finished her homework. 

(b) Pat watched the news this morning iff Sam did not have pizza last night. 

(c) Pat watched the news this morning if and only if Chris finished her homework and 
Sam did not have pizza last night. 

(d) In order for Pat to watch the news this morning, it is necessary and sufficient that 
Sam had pizza last night and Chris finished her homework. 

2. Define the propositional variables as in Problem 1. Express in words the statements 
represented by the following formulas: 

(a) q <t=> r 
(c) p <t=> (gV r) 

3. Consider the following statements: 

p: Niagara Falls is in New York. 
q: New York City is the state capital of New York, 
r: New York City will have more than 40 inches of snow in 2525. 


(b) p (q A r) 
(d) r <t=> (p V q) 


The statement p is true, and the statement q is false. Represent each of the following 
statements by a formula. What is their truth value if r is true? What if r is false? 
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(a) Niagara Falls is in New York if and only if New York City is the state capital of New 
York. 

(b) Niagara Falls is in New York iff New York City will have more than 40 inches of snow 
in 2525. 

(c) Niagara Falls is in New York or New York City is the state capital of New York if 
and only if New York City will have more than 40 inches of snow in 2525. 

4. Express each of the following compound statements symbolically: 

(a) The product xy = 0 if and only if either x = 0 or y = 0. 

(b) The integer n = 4 if and only if 7n — 5 = 23. 

(c) A necessary condition for x = 2 is x 4 — x 2 — 12 = 0. 

(d) A sufficient condition for x = 2 is x 4 — x 2 — 12 = 0. 

(e) For x 4 — x 2 — 12 = 0, it is both sufficient and necessary to have x = 2. 

(f) The sum of squares x 2 + y 2 > 1 iff both x and y are greater than 1. 

5. Determine the truth values of the following statements (assuming that x and y are real 
numbers): 

(a) The product xy = 0 if and only if either x = 0 or y = 0. 

(b) The sum of squares x 2 + y 2 > 1 iff both x and y are greater than 1. 

(c) x 2 — 4x + 3 — 0 <t=> x = 3. 

(d) x 2 > y 2 <t=> x > y. 

6. Determine the truth values of the following statements (assuming that x and y are real 
numbers) : 

(a) u is a vowel if and only if & is a consonant. 

(b) x 2 + y 2 = 0 if and only if x = 0 and y = 0. 

(c) x 2 — 4x + 4 = 0 if and only if x = 2. 

(d) xy ^ 0 if and only if x and y are both positive. 

7. We have seen that a number n is even if and only if n = 2q for some integer q. Accordingly, 
what can you say about an odd number? 

8. We also say that an integer n is even if it is divisible by 2, hence it can be written as 
n = 2q for some integer q, where q represents the quotient when n is divided by 2. Thus, 
n is even if it is a multiple of 2. What if the integer n is a multiple of 3? What form must 
it take? What if n is not a multiple of 3? 

2.5 Logical Equivalences 

A tautology is a proposition that is always true, regardless of the truth values of the propo- 
sitional variables it contains. A proposition that is always false is called a contradiction. A 
proposition that is neither a tautology nor a contradiction is called a contingency . 

Example 2.5.1 From the following truth table 


p 

P 

pV p 

pAp 

T 

F 

T 

F 

F 

T 

T 

F 


we gather that p V p is a tautology, and p A p is a contradiction. 

In words, pVp says that either the statement p is true, or the statement p is true (that is, p 
is false). This claim is always true. 

The compound statement p t\p claims that p is true, and at the same time, p is also true 
(which means p is false). This is clearly impossible. Hence, p !\p must be false. ▲ 
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Example 2.5.2 Show that {p => q) (q => p) is a tautology. 
Solution: We can use a truth table to verify the claim. 


p 

q 

p=> q 

q 

P 

<7 => P 

(p => q) <=> (q => p) 

T 

T 

T 

T 

F 

F 

T 

F 

T 

T 

F 

T 

T 

T 

F 

F 

T 

T 

T 

T 

T 


Note how we work on each component of the compound statement separately before putting 
them together to obtain the final answer. A 

Example 2.5.3 Show that the argument 

“If p and g, then r. Therefore, if not r, then not p or not q. v 
is valid. In other words, show that the logic used in the argument is correct. 

Solution: Symbolically, the argument says 

[(PA«) => r] => [f=> (pVq)]. (2.1) 

We want to show that it is a tautology. It is easy to verify with a truth table. We can also argue 
that this compound statement is always true by showing that it can never be false. 

Suppose, on the contrary, that (2.1) is false for some choices of p, q , and r. Then 

(p A q) => r must be true, and f => (p V q) must be false. 

For the second implication to be false, we need 

r to be true, and pVg to be false. 

They in turn imply that r is false, and both p and q are false; hence both p and q are true. This 
would make (p A q) =$■ r false, contradicting the assumption that it is true. Thus, (2.1) cannot 
be false, it must be a tautology. A 

Hands-On Exercise 2.5.1 Use a truth table to show that 

[(P Ag)=tr]=t[r=t(pV q)] 


is a tautology. 

Solution: We need eight combinations of truth values in p 1 q , and r. We list the truth values 
according to the following convention. In the first column for the truth values of p , fill the upper 
half with T and the lower half with F. In the next column for the truth values of q , repeat the 
same pattern, separately, with the upper half and the lower half. So we split the upper half of 
the second column into two halves, fill the top half with T and the lower half with F. Likewise, 
split the lower half of the second column into two halves, fill the top half with T and the lower 
half with F. Repeat the same pattern with the third column for the truth values of r, and so on 
if we have more propositional variables. 

Complete the following table: 


V 

<? 

V 

pAq 

(p A q) => r 

r 

V 

q 

p v? 

r => (pV q) 

[(p A q) => r] => [r => [p V q)} 

T 

T 

T 









T 

T 

F 









T 

F 

T 









T 

F 

F 









F 

T 

T 









F 

T 

F 









F 

F 

T 









F 

F 

F 
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Do not write p = q; 
instead, write p = q. 


Question: If there are four propositional variables in a proposition, how many rows are there in 
the truth table? A 

Definition. Two logical formulas p and q are said to be logically equivalent, denoted 

p = q, 


if p <t=> q is a tautology. <0> 

We are not saying that p is equal to q. Since p and q represent two different statements, 
they cannot be the same. What we are saying is, they always produce the same truth value, 
regardless of the truth values of the underlying propositional variables. That is why we write 
p = q instead oi p = q. 

Example 2.5.4 We have learned that 

p q = {p => q) a {q => p), 

which is the reason why we call p q a biconditional statement. ▲ 

Example 2.5.5 Use truth tables to verify the following equivalent statements. 

(a) p => q=pV q. 

(b) p A (q V r) = (p A q) V (p A r). 


Solution: The truth tables for (a) and (b) are depicted below. 


V 

q 

p=> q 

P 

pv? 

T 

T 

T 

F 

T 

T 

F 

F 

F 

F 

F 

T 

T 

T 

T 

F 

F 

T 

T 

T 


P 

q 

r 

<7 V r 

pA (gVr) 

pAq 

q A r 

(p A q) V (p A r) 

T 

T 

T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

T 

T 

F 

T 

T 

F 

T 

T 

T 

F 

T 

T 

T 

F 

F 

F 

F 

F 

F 

F 

F 

T 

T 

T 

F 

F 

F 

F 

F 

T 

F 

T 

F 

F 

F 

F 

F 

F 

T 

T 

F 

F 

F 

F 

F 

F 

F 

T 

F 

F 

F 

F 


Example (a) is an important result. It says that p => q is true when one of these two things 
happen: (i) when p is false, (ii) otherwise (when p is true) q must be true. ▲ 


Hands-On Exercise 2.5.2 Use truth tables to establish these logical equivalences. 


(a) p=^q = q=^p 

(b) pV p = p 

(c) p A q = p V q 

(d) p«?E(p^g)A(q=tp) 
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Solution: We have set up the table for (a), and leave the rest to you. 


p 

q 

p=> q 

q 

p 

q^p 

T 

T 





T 

F 





F 

T 





F 

F 






A 

Hands-On Exercise 2.5.3 The logical connective exclusive or, denoted pY q, means either p 
or q but not both. Consequently, 

P Y q = {p V q) A (p A q) = {p A q) V (p A q). 

Construct a truth table to verify this claim. 


A 

Properties of Logical Equivalence. Denote by T and F a tautology and a contradiction, 
respectively. We have the following properties for any propositional variables p, q, and r. 

1. Commutative properties : pV q = g V p, 

p A q = q A p. 

2. Associative properties: (pV <7) V r = pV (q V r), 

(p A q) A r = p A (q A r) . 

3. Distributive laws: pV (g A r) = (pV (7) A (p V r), 

pA (g V r) = (pAq) V (pA r). 

4. Idempotent laws: pVp = p, 

p Ap = p. 

5. .De Morgan’s laws: p\J q = p Aq, 

p A q = pV q. 

6. Laws of the excluded middle , or inverse laws: pVp = T, 

p A p = F. 


7. Identity laws: pV F = p, 

p AT = p. 


8. Domination laws: pVT = T, 

p A F = F. 
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Two VERY important 
logical equivalences. 
Memorize them! 


9. Equivalence of an implication and its contrapositive: p =>■ q = q =>■ p. 

10. Writing an implication as a disjunction: p => q = p V q. 

Be sure you memorize the last two equivalences, because we will use them frequently in the rest 
of the course. 

Remark. These properties are not easy to recall. Instead of focusing on the symbolic formulas, 
try to understand their meanings. Let us explain them in words, and compare them to similar 
operations on the real numbers, 

1. Commutative properties: In short, they say that “the order of operation does not 
matter.” It does not matter which of the two logical statements comes first, the result 
from conjunction and disjunction always produces the same truth value. Compare this to 
addition of real numbers: x + y = y + x. Subtraction is not commutative, because it is not 
always true that x — y = y — x. This explains why we have to make sure that an operation 
is commutative. 

2. Associative properties: Roughly speaking, these properties also say that “the order 
of operation does not matter.” However, there is a key difference between them and the 
commutative properties. 

• Commutative properties apply to operations on two logical statements, but associa- 
tive properties involves three logical statements. Since A and V are binary operations, 
we can only work on a pair of statements at a time. Given the three statements p 1 
q, and r, appearing in that order, which pair of statements should we operate on 
first? The answer is: it does not matter. It is the order of grouping (hence the term 
associative) that does not matter in associative properties. 

• The important consequence of the associative property is: since it does not matter 
on which pair of statements we should carry out the operation first, we can eliminate 
the parentheses and write, for example, 

p\/ q\/ r 

without worrying about any confusion. 

• Not all operations are associative. Subtraction is not associative. Given three num- 
bers 5, 7, and 4, in that order, how should we carry out two subtractions? Which 
interpretation should we use: 

(5 - 7) - 4, or 5 — (7 — 4)? 

Since they lead to different results, we have to be careful where to place the paren- 
theses. 

3. Distributive laws: When we mix two different operations on three logical statements, 
one of them has to work on a pair of statements first, forming an “inner” operation. This 
is followed by the “outer” operation to complete the compound statement. Distributive 
laws say that we can distribute the “outer” operation over the inner one. 

4. Idempotent laws: When an operation is applied to a pair of identical logical statements, 
the result is the same logical statement. Compare this to the equation x 2 = x, where x is 
a real number. It is true only when x = 0 or x = 1. But the logical equivalences pV p = p 
and p A p = p are true for all p. 

5. De Morgan’s laws: When we negate a disjunction (respectively, a conjunction), we 
have to negate the two logical statements, and change the operation from disjunction to 
conjunction (respectively, from conjunction to a disjunction). 



2.5 Logical Equivalences 


33 


6. Laws of the excluded middle , or inverse laws: Any statement is either true or false, 
hence p V p is always true. Likewise, a statement cannot be both true and false at the 
same time, hence p A p is always false. 

7. Identity laws: Compare them to the equation x - 1 = x: the value of x is unchanged after 
multiplying by 1. We call the number 1 the multiplicative identity. For logical operations, 
the identity for disjunction is F, and the identity for conjunction is T. 

8. Domination laws: Compare them to the equation x ■ 0 = 0 for real numbers: the result 

is always 0, regardless of the value x. The “zero” for disjunction is T, and the “zero” for 
conjunction is F. <£> 

Example 2.5.6 What is the negation of 2 < x < 3? Give a logical explanation as well as a 
graphical explanation. 

Solution: The inequality 2 < x < 3 means 

(2 < x) f\ (x < 3). 

Its negation, according to De Morgan’s laws, is 

(2 > a:) V (x > 3). 

The inequality 2 < x < 3 yields a closed interval. Its negation yields two open intervals. Their 
graphical representations on the real number line are depicted below. 

• • e e 

2 3 2 3 

(2 < x) A (x < 3) (2 > x) V (x > 3) 

Take note of the two endpoints 2 and 3. They change from inclusion to exclusion when we take 
negation. ▲ 

Hands-On Exercise 2.5.4 Since 0 < x < 1 means “0 < x and x < 1,” its negation should be 
“0 > x or x > 1,” which is often written as “x < 0 or x > 1.” Explain why it is inappropriate, 
and indeed incorrect, to write “0 > x > 1.” 


A 


Example 2.5.7 Expand (p A q) V (r A s). 

Solution: Compare this problem to the expansion of (x + y)(u + v). We use the distributive 
law twice to obtain 


(x + y)(u + v) = x(u + v) + y{u + v) 

= xu + xv + yu + yv. 

Let us follow the same procedure to expand (p A q) V (r A s). We need to apply the distributive 
law twice. The first time, regard (r A s) as a single statement, and distribute it over p A q. In the 
second round, distribute p and g, separately, over r A s. The complete solution is shown below. 

{p A q) V (r A s) = [p V (r A s)] A [q V (r A s)] 

= (p V r) A (p V s) A (q V r) A {q V s). 


See Example 2.2.5. 
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Wrong ways to negate 
an implication. 


How to negate an 
implication? 


We can also proceed as follows: 

(p A q) V (r A s) = [{p A g) V r] A [(p Ag) Vs] 

= (p V r) A (g V r) A (p V s) A (g V s). 

The two results are identical because A is commutative. A 

Hands-On Exercise 2.5.5 Expand (pV g) A (r V s). 


A 


Example 2.5.8 We have used a truth table to verify that 

[(p A g) => r] => [r => (p V g)] 

is a tautology. We can use the properties of logical equivalence to show that this compound 
statement is logically equivalent to T. This kind of proof is usually more difficult to follow, so 
it is a good idea to supply the explanation in each step. Here is a complete proof: 


[(p A?) =>r] => [r=> (pV q)} 


{p A q) =>• r V [f => (p V g)] 
(p A g) => r V [p V g => r] 
(p A g) => r V [{p A g) => r] 
T 


(implication as disjunction) 
(implication as disjunction) 
(De Morgan’s law) 

(inverse law) 


This is precisely what we called the left-to-right method for proving an identity (in this case, a 
logical equivalence) . A 


Example 2.5.9 Write p =$> g as a conjunction. 
Solution: It is important to remember that 

P => Q 


and 

P=> q^P=>Q 

either. Instead, since p => q=pV g, it follows from De Morgan’s law that 


p=>g = pVg=pAg. 


Alternatively, we can argue as follows. Interpret p => q as saying p => q is false. This requires 
to be true and q to be false, which translates into p A q. Thus, p => q = p A q. 

Summary and Review 

• Two logical statements are logically equivalent if they always produce the same truth 
value. 

• Consequently, p = q is same as saying p q is a tautology. 

• Beside distributive and De Morgan’s laws, remember these two equivalences as well; they 
are very helpful when dealing with implications. 

p => q = q => p and p => q = p V q. 


► ^3 
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Exercises 2.5 

1. Use a truth table to verify the De Morgan’s law pV q = p Aq. 

2. Use truth tables to verify the two associative properties. 

3. Construct a truth table for each formula below. Which ones are tautologies? 

(a) (pV q)^p 

(b) (p => q) V (p => q) 

(c) (p => <?) => r 

4. Use truth tables to verify these logical equivalences. 

(a) (p/\q)^p = p^q 

(b) (p A q) => r = p => (g V r) 

(c) (p => g) A (p => r) = p A (g V r) 

5. Use only the properties of logical equivalences to verify (b) and (c) in Problem 4. 

6. Determine whether formulas u and v are logically equivalent (you may use truth tables or 
properties of logical equivalences) . 


(a) u: (p => q) A (p => q) 

v : p 

(b) u : p =>• q 

v : q => p 

(c) m : p ^ q 

u : q p 

(d) u : (p =>■ q) => r 

v : p => (q => r) 


7. Find the converse, inverse, and contrapositive of these implications. 

(a) If triangle ABC is isosceles and contains an angle of 45 degrees, then ABC is a right 
triangle. 

(b) If quadrilateral ABCD is a square, then it is both a rectangle and a rhombus. 

(c) If quadrilateral ABCD has two sides of equal length, then it is either a rectangle or 
a rhombus. 

8. Negate the following implications: 

(a) x 2 > 0 => x > 0. 

(b) If PQRS is a square, then PQRS is a parallelogram. 

(c) If n > 1 is prime, then n + 1 is composite. 

(d) If x and y are integers such that xy > 1, then either x > 1 or y > 1. 

9. Determine whether the following formulas are true or false: 

(a) p <=> q = p^q 

(b) (p => q)V {p=>q) =p 

(c) p^ q = q=> p 

10. Determine whether the following formulas are true or false: 

(a) (p => q) r = p => (q => r) 

(b) p => (q V r) = (p => q) V (p => r) 

(c) p => (q A r) = (p =>■ g) A (p => r) 

11. Which of the following statements are equivalent to the statement “if x 2 > 0, then x > 0”? 

(a) If x > 0, then x 2 > 0. 

(b) If x < 0, then x 2 < 0. 

(c) If x 2 < 0, then x < 0. 

(d) If x 2 ^ 0, then x ^ 0. 
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12. Determine whether the following formulas are tautologies, contradictions, or neither: 

(a) (p => q) A p 

(b) (p => g) A (p A g) 

(c) (p=^q)Aq 

13. Simplify the following formulas: 

(a) p A (p A q) (b) pVq 

14. Simplify the following formulas: 

(a) (p^q)A{q=A p) (b) pAq 


(c ) P=>q 


(c) pA(pVq) 


2.6 Logical Quantifiers 


The expression 


x > 5 


is neither true nor false. In fact, we cannot even determine its truth value unless we know the 
value of x. This is an example of a propositional function, because it behaves like a function 
of x, it becomes a proposition when a specific value is assigned to x. Propositional functions are 
also called predicates . 


Example 2.6.1 Denote the propositional function “x > 5” by p(x). We often write 

p(x) : x > 5. 


It is not a proposition because its truth value is undecidable, but p{ 6), p( 3) and p(— 1) are 
propositions. A 


Example 2.6.2 Define 

q{x,y) : x + y= 1. 

Which of the following are propositions; which are not? 

(a) q(x,y) 

(b) q(x, 3) 

(c) <7(1, 1) 

(d) <?(5, —4) 

For those that are, determine their truth values. 


Solution: Both (a) and (b) are not propositions, because they contain at least one variable. 
Both (c) and (d) are propositions; q( 1, 1) is false, and q( 5, —4) is true. A 

Hands-On Exercise 2.6.1 Determine the truth values of these statements, where q{x,y) is 
defined in Example 2.6.2. 

(a) <7(5, -7) 

(b) q(- 6,7) 

(c) q(x + l,-x) 


A 
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Although a propositional function is not a proposition, we can form a proposition by means 
of quantification. The idea is to specify whether the propositional function is true for all or 
for some values that the underlying variables can take on. 

Definition. The universal quantification of p(x) is the proposition in any of the following 
forms: 


• p(x) is true for all values of x. 

• For all x, p(x). 

• For each x, p{x). 

• For every x, p{x). 

• Given any x, p(x). 

All of them are symbolically denoted by 

\/xp(x), 

which is pronounced as 

“for all x , p(x)”. 

The symbol V is called the universal quantifier , and can be extended to several variables. <£> 
Example 2.6.3 The statement 

“For any real number x, we always have x 2 > 0” 
is true. Symbolically, we can write 

ViS R (x 2 > 0), or Va: (x € R. => x 2 > 0). 


The second form is a bit wordy, but could be useful in some situations. 


A 


Example 2.6.4 The statement 

\/x el(a;>5) 


is false because x is not always greater than 5. To disprove a claim, it suffices to provide only 
one counterexample. We can use x = 4 as a counterexample. 

However, examples cannot be used to prove a universally quantified statement. Consider the 
statement 

Va; € R (x 2 > 0). 


By direct calculations, one may demonstrate that x 2 > 0 is true for many a:-values. But it does 
not prove that it is true for every x, because there may be a counterexample that we have not 
found yet. We have to use mathematical and logical argument to prove a statement of the form 
“Va ’p(x).” A 


Example 2.6.5 The statement 

“Every Discrete Mathematics student has taken Calculus I and Calculus If” 

is clearly a universally quantified proposition. To express it in a logical formula, we can use an 
implication: 

Va; (x is a Discrete Mathematics student => x has taken Calculus 1 and Calculus II) 

An alternative is to say 

Va; £ S (x has taken Calculus I and Calculus II) 

where S represents the set of all Discrete Mathematics students. Although the second form 
looks simpler, we must define what S stands for. A 


Write V, not A. 


Counterexamples can 
be used to disprove a 
claim. 


Examples alone do 
not prove a statement 
of the form “\/xp(x)”. 


See Example 2.6.3. 
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Write 3, not E. 


One example suffices 
to prove a statement 
of the form “3 xp{x).” 


Definition. The existential quantification of p(x) takes one of these forms: 

• There exists an x such that p{x). 

• For some x, p(x). 

• There is some x such that p(x). 

We write, in symbol, 

3xp(x), 

which is pronounced as 

“There exists x such that p(x)." 

The symbol 3 is called the existential quantifier. It can be extended to several variables. <C> 

Example 2.6.6 To prove that a statement of the form “3 xp(x)" is true, it suffices to find an 
example of x such that p(x) is true. Using this guideline, can you determine whether these two 
propositions 

(a) 3x € K. (a: > 5) 

(b) 3i g R (y/x = 0) 

are true? 

Solution: (a) True. For example: x = 6. 

(b) True. For example: x = 0. A 

Example 2.6.7 The proposition 

“There exists a prime number x such that x + 2 is also prime” 
is true. We call such a pair of primes twin primes. A 

Hands-On Exercise 2.6.2 Name a few more examples of twin primes. 


A 

Example 2.6.8 The proposition 

“There exists a real number x such that x > 5” 
can be expressed, symbolically, as 

3x £ ffi. (x > 5), or 3x (x £ R. A x > 5). 

Notice that in an existential quantification, we use A instead of => to specify that a: is a real 
number. A 

Hands-On Exercise 2.6.3 Determine the truth value of each of the following propositions: 
(a) For any prime number x, the number x + 1 is composite. 


(b) For any prime number x > 2, the number x + 1 is composite. 
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(c) There exists an integer k such that 2k + 1 is even. 


(d) For all integers k, the integer 2k is even. 


(e) For any real number x, if x 2 is an integer, then x is also an integer. 


A 


Hands-On Exercise 2.6.4 The proposition 

“The square of any real number is positive” 
is a universal quantification 

“For any real number x, x 2 > 0.” 


Is it true or false? 


A 

Example 2.6.9 When multiple quantifiers are present, the order in which they appear is 
important. Determine whether these two statements are true or false. 

(a) \/x g Z 3 y g R* (. xy < 1) 

(b) By G R* Va; g Z (xy < 1) 

Here, R* denotes the set of all nonzero real numbers. 

Solution: (a) To prove that the statement is true, we need to show that no matter what integer 
x we start with, we can always find a nonzero real number y such that xy < 1. For x < 0, we 
can pick y = 1, which makes xy = x < 0 < 1. For x > 0, let y = x+l , then xy = < 1. This 

concludes the proof that the first statement is true. 

(b) Let y = I . Can we find an integer x such that xy ft 1? Definitely! For example, we can set 
x = 2. This counterexample shows that the second statement is false. A 

Hands-On Exercise 2.6.5 True or false: By g RVa; g Z (xy < 1)? 


A 

Example 2.6.10 Many theorems in mathematics can be expressed as quantified statements. 
Consider 


“If x is rational and y is irrational, then x + y is irrational.” 


This is same as saying 
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Recall how to negate 
an implication. 


“Whenever x is rational and y is irrational, then x + y is irrational.” 

The keyword “whenever” suggests that we should use a universal quantifier. 

Vx, y (x is rational Ay is irrational => x + y is irrational). 

It can also be written as 

Vx € QVy Q (x + y is irrational). 

Although this form looks complicated and seems difficult to understand (primarily because it 
is quite symbolic, hence appears to be abstract and incomprehensible to many students), it 
provides an easy form for negation. See the discussion below. 

The fact that an implication can be expressed as a universally quantified statement sounds 
familiar. See Example 2.3.7. ▲ 

We shall learn several basic proof techniques in Chapter 3. Some of them require negating 
a logical statement. Since many mathematical results are stated as quantified statements, it is 
necessary for us to learn how to negate a quantification. The rule is rather simple. Interchange 
V and 3, and negate the statement that is being quantified. In other words, 

Vxp(x) = 3 xp(x), and 3 xp(x) = Mxp(x). 

If we have Vx £ Z, we only change it to 3s £ Z when we take negation. It should not be negated 
as 3x ^ Z. The reason is: we are only negating the quantification, not the membership of x. In 
symbols, we write 

Vx € Z p(x) = 3x € Z p(x). 

The negation of “3x £ Z p(x)" is obtained in a similar manner. 

Example 2.6.11 We find 

Vx € Z 3 y £ R* (xy < 1) = 3x £ Z \/y £ R* (xy > 1), 

and 

By £ R* Vx £ Z (xy < 1) = \/y £ R* 3x £ Z (xy > 1). 

Remember that we do not change the membership of x and y. ▲ 

Hands-On Exercise 2.6.6 Negate the propositions in Hands-On Exercise 2.6.3. 


A 


Example 2.6.12 The statement 

“All real numbers x satisfy x 2 > 0” 

can be written as, symbolically, Vx £ R (x 2 > 0). Its negation is 3x £ R (x 2 < 0). In words, it 
says “There exists a real number x that satisfies x 2 < 0.” ▲ 

Hands-On Exercise 2.6.7 Negate the statement 

“Every Discrete Mathematics student has taken Calculus I and Calculus II.” 


A 
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Summary and Review 

• There are two ways to quantify a propositional function: universal quantification and 
existential quantification. 

• They are written in the form of “Va ;p(x)” and “3 xp(x)" respectively. 

• To negate a quantified statement, change V to 3, and 3 to V, and then negate the statement. 

Exercises 2.6 

1. Consider these propositional functions: 


p(n): 

n is prime 

q(n ): 

n is even 

r(n): 

n > 2 


Express these formulas in words: 

(a) 3nSZ (p(n) A q(n)) 

(b) VnSZ [r(n) => p(n) V q(n)\ 

(c) 3 n £ Z \p(n) A (g(n) V r(n)] 

(d) VnSZ [(p(n) A r(n)) => g(n)] 

2. Give a formula for each of the following statements: 

(a) For every even integer n there exists an integer k such that n = 2k. 

(b) There exists a right triangle T that is an isosceles triangle. 

(c) Given any quadrilateral Q, if Q is a parallelogram and Q has two adjacent sides that 
are perpendicular, then Q is a rectangle. 

3. Determine whether these statements are true or false: 

(a) There exists an even prime integer. 

(b) There exist integers s and t such that 1 < s < t < 187 and st = 187. 

(c) There is an integer m such that both in/ 2 is an integer and, for every integer k, 
m/ (2k) is not an integer. 

(d) Given any real numbers x and y, x 2 — 2 xy + y 2 > 0. 

(e) For every integer n, there exists an integer m such that m > n 2 . 

4. Determine whether these statements are true or false: 

(a) There is a rational number x such that x 2 < 0. 

(b) There exists a number x such that for every real number y, xy = 0. 

(c) For all x £ Z, either x is even, or x is odd. 

(d) There exists a unique number x such that x 2 = 1. 

5. Find the negation (in simplest form) of each formula. 

(a) Mx < 0 My, z£R(y<z=>xy> xz) 

(b) Vx £ Z [p(x) V q(x )] 

(c) Vx, y £ K [p(x, y) => q(x, y)} 

6. Negate the following statements: 

(a) For all real numbers x, there exists an integer y such that p(x,y) implies q(x,y). 

(b) There exists a rational number x such that for all integers y , either p(x, y) or r(x,y) 
is true. 

(c) For all integers x, there exists an integer y such that if p(x, y) is true, then there 
exists an integer 2 so that q(x, y, z) is true. 


Recall that the set of 
all even integers can 
be written as 2Z. 
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7. For each statement, (i) represent it as a formula, (ii) find the negation (in simplest form) 
of this formula, and (iii) express the negation in words. 

(a) For all real numbers x and y , x + y = y + x. 

(b) For every positive real number x there exists a real number y such that y 2 = x. 

(c) There exists a real number y such that, for every integer x, 2x 2 + 1 > x 2 y. 

8. For each statement, (i) represent it as a formula, (ii) find the negation (in simplest form) 
of this formula, and (iii) express the negation in words. 

(a) There exist rational numbers x\ and X 2 such that aq < X 2 and x\ — X\ > — # 2 - 

(b) For all real numbers x and y there exists an integer z such that 2 z = x + y. 

(c) For all real numbers X\ and X 2 , if xf + Xi — 2 = + X 2 — 2, then X\ = X 2 - 

9. The easiest way to negate the proposition 

“A square must be a parallelogram” 

is to say 

“It is not true that a square must be a parallelogram.” 

Yet, it is not the same as saying 

“A square must not be a parallelogram.” 

Can you explain why? What are other ways to express its negation in words? 

10. Negate these statements: 

(a) All squared numbers are positive. 

(b) All basketball players are over 6 feet tall. 

(c) No quarterback is under 6 feet tall. 
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3.1 An Introduction to Proof Techniques 

A proof is a logical argument that verifies the validity of a statement. A good proof must 
be correct, but it also needs to be clear enough for others to understand. In the following 
sections, we want to show you how to write mathematical arguments. It takes practice to learn 
how to write mathematical proofs; you have to keep trying! We would like to start with some 
suggestions. 

1. Write at the level of your peers. A common question asked by many students is: how 
much detail should I include in a proof? One simple guideline is to write at the level that 
your peers can understand. Although you can skip the detailed computation, be sure to 
include the major steps in an argument. 

2. Use symbols and notations appropriately. Do not use mathematical symbols as 
abbreviations. For example, do not write “x is a number > 4.” Use “x is a number greater 
than 4” instead. Do not use symbols excessively either. It is often clearer if we express 
our idea in words. Finally, do not start a sentence with a symbol, as in “Suppose xy > 0. 
x and y have the same signs.” It would look better if we combine the two sentences, and 
write “Suppose xy > 0, then x and y have the same signs.” 

3. Display long and important equations separately. Make the key mathematical 
results stand out by displaying them separately on their own. Be sure to center these 
expressions. Number them if you need to refer to them later. See Examples 1.3.1 and 1.3.2. 

4. Write in complete sentences, with proper usage of grammar and punctuation. 

A proof is, after all, a piece of writing. It should conform to the usual writing rules. Use 
complete sentences, and do not forget to check the grammar and punctuation. 

5. Start with a draft. Prepare a draft. When you feel it is correct, start revising it: 
check the accuracy, remove redundancy, and simplify the sentence structure. Organize the 
argument into short paragraphs to enhance the readability of a proof. Go over the proof 
and refine it further. 

Some proofs only require direct computation. 

Example 3.1.1 Let a and b be two rational numbers such that a < b. Show that the weighted 
average | a + | b is a rational number between a and b. 

Solution: Since a and b are rational numbers, we can write a = ™ and b = | for some integers 
m, n, p, and g, where n,g/0. Then 

1 2 1 m 2 p mq + 2 np 

3 ° 3 3 n 3 q 3nq 
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is clearly a rational number because mq + 2 np and 3 np are integers, and 3 nq 0. Since a < b, 

we know b — a > 0. It follows that 

which means |a+|6>a. In a similar fashion, we also find 1 a + | b < b. Thus, 1 a + | b is a 
rational number between a and b. A 

Hands-On Exercise 3.1.1 Show that | a + | b is closer to b than to a. 

Hint: Compute the distance between a and | a + | b, and compare it to the distance between 
1 a + | b and b. 


A 

Sometimes, we can use a constructive proof when a proposition claims that certain values 
or quantities exist. 

Example 3.1.2 Prove that every positive integer can be written in the form of 2 e t for some 
nonnegative integer e and some odd integer t. 

Remark. The problem statement only says “every positive integer.” It often helps if we 
assign a name to the integer; it will make it easier to go through the discussion. Consequently, 
we customarily start a proof with the phrase “Let n be ... ” <£> 

Solution: Let n be a positive integer. Keep dividing n by 2 until an odd number t remains. 
Let e be the number of times we factor out a copy of 2. It is clear that e is nonnegative, and we 
have found n = 2 e t. A 

Hands-On Exercise 3.1.2 Express 6, 40, 32, and 15 in the form stated in Example 3.1.2. 


A 

Example 3.1.3 Given any positive integer n, show that there exist n consecutive composite 
positive integers. 

Solution: For each positive integer n, we claim that the n integers 

(n + 1)! + 2, (n + l)! + 3, ... (n + 1)! + n, (n + 1)! + (n + 1) 

are composite. Here is the reason. For each i, where 2 < i < n + 1, the integer 

(n + 1)! + i = 1-2-3 •••(* — 1 )i(i + 1) • • • (n + 1) + i 

= i [1-2-3 ••■(*-l)(* + l)--- (n + 1) + 1] 

A 


is divisible by i and greater than i, and hence is composite. 
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Hands-On Exercise 3.1.3 Construct five consecutive positive integers that are composite. 
Verify their compositeness by means of factorization. 


A 

Example 3.1.4 Let m and n be positive integers. Show that, if mn is even, then an in x n 
chessboard can be fully covered by non-overlapping dominoes. 

Remark. This time, the names m and n have already been assigned to the two positive 
integers. Thus, we can refer to them in the proof without an introduction. 

Solution: Since mn is even, one of the two integers m and n must be even. Without loss of 
generality (since the other case is similar), we may assume m, the number of rows, is even. Then 
m = 2t for some integer t. Each column can be filled with m/2 = t non-overlapping dominoes 
placed vertically. As a result, the entire chessboard can be covered with nt non-overlapping 
vertical dominoes. A 

Hands-On Exercise 3.1.4 Show that, between any two rational numbers a and b, where a < b, 
there exists another rational number. 

Hint: Try the midpoint of the interval [a, b\. 


A 

Hands-On Exercise 3.1.5 Show that, between any two rational numbers a and 6, where a < 5, 
there exists another rational number closer to b than to a. 

Hint : Use a weighted average of a and b. 


A 

Sometimes a non- constructive proof can be used to show the existence of a certain quantity 
that satisfies some conditions. We have learned two such existence theorems from calculus. 

Theorem 3.1.1 (Mean Value Theorem) Let f be a differentiable function defined over a 
closed interval [a, b] . Then there exists a number c strictly inside the open interval (a, b) such 
that /'(c) = Hb) b Z f a < ' a) ■ 

Theorem 3.1.2 (Intermediate Value Theorem) Let f be a function that is continuous over 
a closed interval [a, b]. Then f assumes all values between f(a) and f(b). In other words, for 
any value t between f(a) and f(b), there exists a number c inside [a,b] such that f(c) = t. 

Both results only guarantee the existence of a number c with some specific property; they 
do not tell us how to find this number c. Nevertheless, the Mean-Value Theorem plays a very 
important role in analysis; many of its applications are beyond the scope of this course. We 
could, however, demonstrate an application of the Intermediate Value Theorem. 

Corollary 3.1.3 Let f be a continuous function defined over a closed interval [a, b\. If /(a) 
and f(b) have opposite signs, then the equation f(x) = 0 has a solution between a and b. 
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Proof: According to the Intermediate Value Theorem, f(x) can take on any value between 
/(a) and f(b). Since they have opposite signs, 0 is a number between them. Hence, /(c) = 0 
for some number c between a and b. ■ 

Example 3.1.5 The function f(x) = 5x 3 — 2a; — 1 is a polynomial function, which is known 
to be continuous over the real numbers. Since /( 0) = —1 and /( 1) = 2, Corollary 3.1.3 implies 
that there exists a number between 0 and 1 such that 5a; 3 — 2x — 1 = 0. A 

Hands-On Exercise 3.1.6 Show that the equation l+rrcosa; = 0 has at least one real solution 
between 0 and 

Hint: No function is mentioned here, so you need to define a function, say g(x). Next, you 
need to make sure that g{x) is continuous. What else do you need to do before you can apply 
Corollary 3.1.3? 


A 


Summary and Review 

• Sometimes we can prove a statement by showing how the result can be obtained through 
a construction, and we can describe the construction in an algorithm. 

• Sometimes all we need to do is apply an existence theorem to verify the existence of a 
certain quantity. 


Exercises 3.1 

1. Show that a chessboard with 7 rows and 12 columns can be covered by non-overlapping 
dominoes. 

2. Show that there is a rational number between 1 and 5 whose distance from 5 is seven times 
as long as its distance from 1. 

3. Show that the equation x 3 — 12x + 2 = 0 has at least three real solutions. 

4. Show that if the equation (x 2 + 4) (x — 2) (3x + 5) = 0 has a real solution, the solution must 
be either x = 2ora: = — |. 

5. Show that given any rational number x, there exists an integer y such that x 2 y is an 
integer. 

Hint: Since x is rational, we can write x = — for some integers m and n, where n ^ 0. 
All you need to do is to describe y in terms of m and n. 

6. Show that given any rational number x, and any positive integer k, there exists an integer 
y such that x k y is an integer. 

7. Show that there exists an integer n such that n, n + 2 and n + 4 are all primes. 

8. Find a counterexample to the following claim: For any positive integer n, if n is prime, 
then n 2 + 4 is also prime. 
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3.2 Direct Proofs 

To show that a statement q is true, follow these steps: 

1. Either find a result that states p => g, or prove that p =>■ q is true. 

2. Show or verify that p is true. 

3. Conclude that q must be true. 

The logic is valid because if p => q is true and p is true, then q must be true. Symbolically, we 
are saying that the logical formula 

[(p => q) A p] => q 

is a tautology (we can easily verify this with a truth table). Symbolically, we present the 
argument as 

p=> q 

v 

q 

Such an argument is called modus ponens or the law of detachment. 

Example 3.2.1 The argument 

b 2 > 4 ac =>• ax 2 + bx + c = 0 has two real solutions. 
x 2 — 5x + 6 satisfies b 2 > 4ac. 

~~ x 2 — 5x + 6 = 0 has two real solutions. 

is an example of modus ponens. A 

It is clear that implications play an important role in mathematical proofs. If we have a 
sequence of implications, we could join them “head to tail” to form another implication: 

P=> q 

q =$■ r 
.'. p => r 

This is called the law of syllogism. 

Example 3.2.2 The argument 

German shepherds are dogs. 

Dogs are mammals. 

Mammals are vertebrates. 

German shepherds are vertebrates. 

is valid because of the law of syllogism. A 

The big question is, how can we prove an implication? The most basic approach is the direct 
proof : 

1. Assume p is true. 

2. Deduce from p that q is true. 

The important thing to remember is: use the information derived from p to show that q is true. 
This is how a typical direct proof may look: 

Proof: Assume p is true. Then . . . 

Because of p , we find . . . 

Therefore q is true. ■ 


Proving p => q 
a direct proof. 
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Describe your goal. 


Pay attention to 
the details! 


Example 3.2.3 Prove that if an m x n chessboard can be fully covered by non-overlapping 
dominoes, then mn must be even. 

Solution: Assume the chessboard can be covered by non-overlapping dominoes, and let t be the 
number of dominoes that cover the chessboard. Then the chessboard must contain 2 1 squares. 
Hence mn = 2 1, which means mn must be an even number. A 

Before we continue with more examples, we would like to introduce the formal definition of 
even and odd integers. 

Definition. An integer is even if it can be written as 2 q for some integer q, and odd if it can 
be written as 2q + 1 for some integer q. <0> 

We do not have to use q to denote the integer that, when multiplied by 2, produces an even 
integer. Any letter will work, provided that we mention it is an integer. For example, if n is an 
even integer, then we can write n = 2t for some integer t. The notion of even integers can be 
further generalized. 

Definition. Let m be a nonzero integer. An integer is said to be a multiple of m if it can be 
written as mq for some integer q. <0> 

We are now ready to study more examples. 

Example 3.2.4 Show that the square of an odd integer is odd. 

Solution: Let n be an odd integer. Then n = 2t + 1 for some integer t. and 

n 2 = (2 1 + l) 2 = 4f 2 + At + 1 = 2(2 1 2 + 2 1) + 1, 

where 2 1 2 + 2 1 is an integer. Hence, rc 2 is odd. A 

Hands-On Exercise 3.2.1 Let n be an integer. Show that if n is odd, then n 3 is odd. 


A 


Example 3.2.5 Show that the product of two odd integers is odd. 

Solution: Let x and y be two odd integers. We want to prove that xy is odd. Then x = 2s + 1 
and y = 2t + 1 for some integers s and t, and 

xy — (2s T l)(2f -f- 1) — 4 st -{- 2s T 2t -t- 1 = 2(2 st T s T t) T 1, 
where 2 st + s + t is an integer. Therefore, xy is odd. A 

In this proof, we need to use two different quantities s and t to describe x and y because 
they need not be the same. If we write x = 2s + 1 and y = 2s + 1, we are in effect saying that 
x = y. We have to stress that s and t are integers, because just saying x = 2s + 1 and y = 2t + 1 
does not guarantee x and y are odd. For instance, the even number 4 can be written as 2 • | + 1, 
which is of the form 2s + 1. ft is obvious that 4 is not odd. Even though we can write a number 
in the form 2s + 1, it does not necessarily mean the number must be odd, unless we know with 
certainty that s is an integer. This example illustrates the importance of paying attention to 
the details in our writing. 
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Example 3.2.6 Show that if a: 3 — lx 2 + x — 7 = 0, then x = 7. 

Solution: Assume a: 3 — 7x 2 + x — 7 = 0. Since 

a: 3 — 7a’ 2 + x — 7 = x 2 (x — 7) + (x — 7) = (a; 2 + l)(a; — 7), 

if it is equal to zero, we need either a; 2 + 1 = 0, or x — 7 = 0. Since a; 2 + 1 can never be zero, we 
must have x — 7 = 0; thus x = 7. A 

Hands-On Exercise 3.2.2 Show that if a: 3 + 6a; 2 + 12a; + 8 = 0, then x = —2. 


A 

The last example demonstrates a technique called proof by cases. There are two possibili- 
ties, namely, either (i) a; 2 + 1 = 0, or (ii) x — 7 = 0. The final conclusion is drawn after we study 
these two cases separately. 

Example 3.2.7 Show that if an integer n is not divisible by 3, then n 2 — 1 must be a multiple 
of 3. 

Remark. The letter n has been used to identify the integer of interest to us, and it appears 
in the hypothesis of the implication that we want to prove. Nonetheless, many authors would 
start their proofs with the familiar phrase “Let n be " 0 

Solution: Let n be an integer that is not divisible by 3. When it is divided by 3, the remainder 
is 1 or 2. Hence, n = 3q + 1 or n = 3q + 2 for some integer q. 

• Case 1: If n = 3q + 1 for some integer q, then 

n 2 — 1 = 9 q 2 + 6q = 3(3 q 2 + 2 q), 


where 3 q 2 + 2 q is an integer. 

• Case 2: If n = 3q + 2 for some integer q, then 

n 2 — 1 = 9q 2 + 12q + 3 = 3(3<? 2 +4 q+ 1), 
where 3 q 2 + Aq + 1 is an integer. 

In both cases, we have shown that n 2 — 1 is a multiple 3. A 

Hands-On Exercise 3.2.3 Show that n 3 + n is even for all n £ N. 


A 


Hands-On Exercise 3.2.4 


Show that n(n + 1)(2 n + 1) is divisible by 6 for all n € N. 
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Hint : One of the two integers n and n + 1 must be even, so we already know that the product 
n(n + 1)(2 n + 1) is a multiple of 2. Hence, it remains to show that it is also a multiple of 3. 
Consider three cases: n = 3q, n = 3q + 1, or n = 3q + 2, where q is an integer. 


A 

We close our discussion with two common fallacies (logical errors). The first one is the 

fallacy of the inverse or the denial of the antecedent: 

p=> q 
V 

• Q 

This in effect proves the inverse p => q, which we know is not logically equivalent to the original 
implication. Hence, this is an incorrect method for proving an implication. 

Example 3.2.8 Is the following argument 

Dictionaries are valuable. 

This book is not a dictionary. 

This book is not valuable. 

valid? Why? A 

Another common mistake is known as the fallacy of the converse or the affirmation of 
the consequence: 

p=> q 

q 

p 

This only proves the converse q => p. Since the converse is not logically equivalent to the original 
implication, this is an incorrect way to prove an implication. 

Example 3.2.9 Is this argument 

No medicine tastes good. 

This drink tastes bad. 

This must be medicine. 

a valid argument? Why? A 

Summary and Review 

• To prove an implication p =>■ q, start by assuming that p is true. Use the information from 
this assumption, together with any other known results, to show that q must also be true. 

• If necessary, you may break p into several cases pi,P 2 , ■ ■ ■ , and prove each implication 
Pi => q (separately, one at a time) as indicated above. 

• Be sure to write the mathematical expressions clearly. Use different variables if the quan- 
tities involved may not be the same. 





3.2 Direct Proofs 


51 


• To get started, write down the given information, the assumption, and what you want to 
prove. 

• In the next step, use the definition if necessary, and rewrite the information in mathe- 
matical notations. The point is, try to obtain some mathematical equations or logical 
statements that we can manipulate. 

Exercises 3.2 

1. Prove or disprove: 2 n + 1 is prime for all nonnegative integer n. 

2. Show that for any integer n > 5, the integers n, n + 2 and n + 4 cannot be all primes. 

Hint: If n is a multiple of 3, then n itself is composite, and the proof will be complete. So 
we may assume n is not divisible by 3. Then what would n look like, and, what can you 
say about n + 2 and n + 4? 

3. Let n be an integer. 

(a) Show that if n is odd, then n 2 is also odd. 

(b) Show that if n is odd, then n 4 is also odd. 

(c) A corollary is a result that can be derived easily from another result. Derive (b) as 
a corollary of (a). 

(d) Show that if m and n are odd, then so is mn. 

(e) Show that if m is even, and n is odd, then mn is even. 

4. Prove that, for any odd integer n, the number 2 n 2 + 5n + 4 must be odd. 

5. Let n be an integer. 

(a) Prove that if n is a multiple of 3, then n 2 is also a multiple of 3. 

(b) Prove that if n is a multiple of 7, then n 3 is also a multiple of 7. 

6. Prove that if n is not a multiple of 3, then n 2 is also not a multiple of 3. 

Hint : If n is not a multiple of 3, then n = 3q + 1 or n = 3q + 2 for some integer q. 

7. Use the facts that 

(i) \/2 is irrational, and 

(ii) if x is irrational, then y/x is also irrational, 
to prove that v^2 is irrational. 

8. Recall that we can use a counterexample to disprove an implication. Show that the 
following claims are false: 

(a) If x and y are integers such that x 2 > y 2 , then x > y. 

(b) If n is a positive integer, then n 2 + n + 41 is prime. 

9. Explain why the following arguments are invalid: 

(a) Let n be an integer. If n 2 is odd, then n is odd. Therefore, n must be odd. 

(b) Let n be an integer. If n is even, then n 2 is also even. As an integer, n 2 could be 
odd. Hence, n cannot be even. Therefore, n must be odd. 

10. Analyze the following reasoning: 

(a) Let S' be a set of real numbers. If x is in S, then x 2 is in S. But x is not in S, hence 
x 2 is not in S. 

(b) Let S be a set of real numbers. If x is in S, then x 2 is in S. Therefore, if x 2 is in S, 
then x is in S. 
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Proof by 
contrapositive. 


Remember to describe 
your goal. 


3.3 Indirect Proofs 

Instead of proving p => q directly, it is sometimes easier to prove it indirectly. There are two 
kinds of indirect proofs: the proof by contrapositive, and the proof by contradiction. 

The proof by contrapositive is based on the fact that an implication is equivalent to its 
contrapositive. Therefore, instead of proving p => q 1 we may prove its contrapositive q => p. 
Since it is an implication, we could use a direct proof: 

1. Assume q is true (hence, assume q is false). 

2. Show that p is true (that is, show that p is false). 

The proof may proceed as follow: 

Proof: We want to prove the contrapositive of the stated result. 

Assume q is false, . . . 


Therefore p is false. ■ 


Example 3.3.1 Let n be an integer. Show that if n 2 is even, then n is also even. 

Solution: Proof by contrapositive: We want to prove that if n is odd, then n 2 is odd. If n is 
odd, then n = 2t + 1 for some integer t. Hence, 

n 2 = 4f 2 + At + 1 = 2(2 1 2 +2t) + l 

is odd. This completes the proof. A 

Example 3.3.2 Show that if n is a positive integer such that the sum of its positive divisors 
is n + 1, then n is prime. 

Solution: We shall prove the contrapositive of the given statement. We want to prove that if 
n is composite, then the sum of its positive divisors is not n + 1. Let n be a composite number. 
Then its divisors include 1, n, and at least one other positive divisor x different from 1 and n. 
So the sum of its positive divisors is at least 1 + n + x. Since x is positive, we gather that 

1 + + a: > l+n. 

We deduce that the sum of the divisors cannot be n + 1. Therefore, if the sum of the divisors 
of n is precisely n + 1, then n must be prime. A 

Example 3.3.3 Let a: be a real number. Prove that if x 3 — 7x 2 + x — 7 = 0, then x = 7. 

Solution: Assume x 7, then 

x 3 — 7x 2 + x — 7 = x 2 (x — 7) + (x — 7) = (x 2 + l)(x — 7) 0. 

Thus, if x 3 — 7x 2 + x — 7 = 0, then x = 7. A 

Hands-On Exercise 3.3.1 Let a: be a real number. Prove that if (2a: 2 + 3)(x + 5)(x — 7) = 0, 
then either x = — 5 , or x = 7. 


A 
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Hands-On Exercise 3.3.2 Let x and y be two real numbers. Prove that if x ^ 0 and y ^ 0, 
then xy ^ 0. 


A 

Another indirect proof is the proof by contradiction. To prove that p q, we proceed as 
follows: 

1. Suppose p => q is false; that is, assume that p is true and q is false. 

2. Argue until we obtain a contradiction, which could be any result that we know is false. 

How does this prove that p =$■ ql Assuming that the logic used in every step in the argument 
is correct, yet we still end up with a contradiction, then the only possible flaw must come from 
the supposition that p =A q is false. Consequently, p => q must be true. 

This is what a typical proof by contradiction may look like: 


Proof: Suppose p => q is false. Then p is true and q is false. Then 

. . . , which is a contradiction. Therefore, p => q must be true. I 

There is a more general form for proving a statement r, which needs not be an implication. 
To prove the proposition r by contradiction, we follow these steps: 

1. Suppose r is false. 

2. Argue until we obtain a contradiction. 


Proof: Suppose r is false. Then . . . 


. . . , which is a contradiction. Therefore, r must be true. ■ 


Example 3.3.4 Show that if a: 3 — 7x 2 + x — 7 = 0, then x = 7. 

Solution: Assume a: 3 — 7x 2 + x — 7 = 0, we want to show that x = 7. Suppose x ^ 7, then 
x — 7 0, and 


0 = a: 3 — 7x 2 + x — 7 = x 2 (x — 7) + (x — 7) = (x 2 + l)(x — 7) 

would have implied that x 2 + 1 = 0, which is impossible. Therefore, we must have x = 7. A 

Example 3.3.5 Show that if P is a point not on a line L, then there exists exactly one 
perpendicular line from P onto L. 

Solution: Suppose we can find more than one perpendicular line from P onto L. Pick any two 
of them, and denote their intersections with L as Q and R. Then we have a triangle PQR, 
where the angles PQR and PRQ are both 90° . This implies that the sum of the interior angles 


Proof by 
contradiction. 


Another form of proof 
by contradiction. 
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of the triangle PQR exceeds 180°, which is impossible. Hence, there is only one perpendicular 
line from P onto L. A 

Example 3.3.6 Show that if x 2 < 5, then \x\ < a/5. 

Solution: Assume x 2 < 5, we want to show that \x\ < \/E. Suppose, on the contrary, we have 
\x\ > \/5. Then either x > VE, or x < — \/E. If x > VE, then x 2 > 5. If x < — \/5, we again 
have x 2 > 5. In either case, we have a contradiction. Hence |rc| < \/E. A 

Hands-On Exercise 3.3.3 Prove that if x 2 > 49, then |a;| > 7. 


A 


Example 3.3.7 Prove that the logical formula 

[( P => q) A p\ =>■ q 


is a tautology. 

Solution: Suppose \(p => q) A p] q is false for some statements p and q. Then we find 

• {p o) A p is true, and 

• q is false. 

For the conjunction (p =>■ q) A p to be true, we need 

• p => q to be true, and 

• p to be true. 

Having p true and q false would make p =>• q false. This directly contradicts what we have found. 
Therefore, the logical formula [(p q) A p] q is always true, hence it is a tautology. A 

Example 3.3.8 Prove, by contradiction, that if x is rational and y is irrational, then x + y is 
irrational. 


Solution: Let x be a rational number and y an irrational number. We want to show that x + y 
is irrational. Suppose, on the contrary, that x + y is rational. Then 

TO 

x + y = — 
n 

for some integers to and n, where n ^ 0. Since x is rational, we also have 



q 


for some integers p and q , where <7^0. It follows that 


TO 

n 


x + y 


P 

q 


+ y- 


m. p mq — np 

n q nq 


Hence, 
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where mq — np and nq are both integers, with nq ^ 0. This makes y rational, which contradicts 
the assumption that y is irrational. Thus, x + y cannot be rational, it must be irrational. A 

Hands-On Exercise 3.3.4 Prove that 

Vx + y + 

for any positive real numbers x and y. 

Hint. The words “for any” suggest this is a universal quantification. Be sure you negate the 
problem statement properly. 


A 


Example 3.3.9 Prove that \/2 is irrational. 

Solution: Suppose, on the contrary, y/2 is rational. Then we can write 

y/2 = — 
n 

for some positive integers m and n such that m and n do not share any common divisor except 
1 (hence ^ is in its simplest term). Squaring both sides and cross-multiplying yields 

2 n 2 = m 2 . 

Thus, 2 divides m 2 . Consequently, 2 must also divide m. Then we can write m = 2s for some 
integer s. The equation above becomes 

2 n 2 = m 2 = (2s) 2 = 4s 2 . 


Hence, 


n 2 = 2s 2 , 


which implies that 2 divides n 2 ; thus, 2 also divides n. We have proved that both m and n are 
divisible by 2. This contradicts the assumption that m and n do not share any common divisor. 
Therefore, y/2 must be irrational. A 

Hands-On Exercise 3.3.5 Prove that y/3 is irrational. 


A 
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Very often, a proof by contradiction can be rephrased into a proof by contrapositive or even 
a direct proof, both of which are easier to follow. If this is the case, rewrite the proof. 


Example 3.3.10 Show that x 2 + 4x + 6 = 0 has no real solution. In symbols, show that 
$x € K. ( x 2 + 4x + 6 = 0). 

Solution: Consider the following proof by contradiction: 


Suppose there exists a real number x such that x 2 + 4x + 6 = 0. Using calculus, 
it can be shown that the function f(x) = x 2 + 4x + 6 has an absolute minimum 
at x = —2. Thus, f(x) > /(— 2) = 2 for any x. This contradicts the assumption 
that there exists an x such that x 2 + 4x + 6 = 0. Thus, x 2 + 4x + 6 = 0 has no 
real solution. 


A close inspection reveals that we do not really need a proof by contradiction. The crux of the 
proof is the fact that x 2 + 4x + 6 > 2 for all x. This already shows that x 2 + 4x + 6 could never 
be zero. It is easier to use a direct proof, as follows. 


Using calculus, we find that the function f(x) = x 2 + 4x + 6 has an absolute 
minimum at x = —2. Therefore, for any x, we always have f{x) > /(— 2) = 2. 
Hence, there does not exist any x such that x 2 + 4x + 6 = 0. 


Do you agree that the second proof (the direct proof) is more elegant? 


A 


Recall that a biconditional statement p -o- q consists of two implications p => q and q => p. 
Hence, to prove p <=> q, we need to establish these two “directions” separately. 


Example 3.3.11 


Let n be an integer. Prove that n 2 is even if and only if n is even. 


Solution: (=>) We first prove that if n 2 is even, then n must be even. We shall prove its 
contrapositive: if n is odd, then n 2 is odd. If n is odd, then we can write n — 2t + 1 for some 
integer t. Then 

n 2 = (2 1 + 1) = 4t 2 + 4t + 1 = 2(2 1 2 + 2 1) + 1, 
where 2 1 2 + 2 1. is an integer. Thus, n 2 is odd. 

(*^=) Next, we prove that if n is even, then n 2 is even. If n is even, we can write n = 2t for 
some integer t. Then 

n 2 = (2 1) 2 = 4t 2 = 2 • 2 1 2 , 

where 2 1 2 is an integer. Hence, n 2 is even, which completes the proof. A 

Hands-On Exercise 3.3.6 Let n be an integer. Prove that n is odd if and only if n 2 is odd. 


A 
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Summary and Review 

• We can use indirect proofs to prove an implication. 

• There are two kinds of indirect proofs: proof by contrapositive and proof by contradiction. 

• In a proof by contrapositive, we actually use a direct proof to prove the contrapositive of 
the original implication. 

• In a proof by contradiction, we start with the supposition that the implication is false, 
and use this assumption to derive a contradiction. This would prove that the implication 
must be true. 

• A proof by contradiction can also be used to prove a statement that is not of the form 
of an implication. We start with the supposition that the statement is false, and use this 
assumption to derive a contradiction. This would prove that the statement must be true. 

• Sometimes a proof by contradiction can be rewritten as a proof by contrapositive or even 
a direct proof. If this is true, rewrite the proof. 

Exercises 3.3 

1. Let n be an integer. Prove that if n 2 is even, then n must be even. Use 

(a) A proof by contrapositive. 

(b) A proof by contradiction. 

Remark : The two proofs are very similar, but the wording is slightly different, so be sure 
you present your proofs clearly. 

2. Let n be an integer. Show that if n 2 is a multiple of 3, then n must also be a multiple of 

3. Use 

(a) A proof by contrapositive. 

(b) A proof by contradiction. 

3. Let n be an integer. Prove that if n is even, then n 2 = 4s for some integer s. 

4. Let m and n be integers. Show that mn = 1 implies that m = 1 or m = —1. 

5. Let i be a real number. Prove by contrapositive: if x is irrational, then y/x is irrational. 
Apply this result to show that \f2 is irrational, using the assumption that \/2 is irrational. 

6. Let x and y be real numbers such that x ^ 0. Prove that if x is rational, and y is irrational, 
then xy is irrational. 

7. Prove that -\/5 is irrational. 

8. Prove that \f2 is irrational. 

9. Let a and b be real numbers. Show that if a ^ b, then a 2 + b 2 ^ 2 ab. 

10. Use contradiction to prove that, for all integers k > 1, 

2\/k + 1 A — A 2\J k -\- 2. 

Vk+1 

11. Let m and n be integers. Show that mn is even if and only if m is even or n is even. 

12. Let x and y be real numbers. Show that x 2 + y 2 = 0 if and only if x = 0 and y = 0. 

13. Prove that, if x is a real number such that 0 < x < 1, then a:(l — x) < \. 

14. Let m and n be positive integers such that 3 divides mn. Show that 3 divides to, or 3 
divides n. 



58 


Chapter 3 Proof Techniques 


15. Prove that the logical formula 

(p => q) V (p => q) 

is a tautology. 

16. Prove that the logical formula 


{(p => g) A (p => g)] => p 


is a tautology. 


3.4 Mathematical Induction: An Introduction 


Mathematical induction can be used to prove that an identity is valid for all integers n > 1. 
Here is a typical example of such an identity: 




n(n + 1) 
2 


More generally, we can use mathematical induction to prove that a propositional function P(n) 
is true for all integers n > 1. 


Mathematical Induction. To show that a propositional function P(n) is true for all integers 
n > 1, follow these steps: 

1. Basis Step: Verify that P(l) is true. 

2. Inductive Step: Show that if P{k) is true for some integer k > 1, then P(fc + 1) is also 
true. 

The basis step is also called the anchor step or the initial step. This proof technique is valid 
because of the next theorem. 

Theorem 3.4.1 (Principle of Mathematical Induction) If S C N such that 

(i) 1 £ S, and 

(ii) k€S^k+lGS, 

then S = N. 


Remark. Here is a sketch of the proof. From (i), we know that 1 £ S. It then follows from 
(ii) that 2 £ S. Applying (ii) again, we find that 3 € S'. Likewise, 4 £ S, then 5 £ S, and so on. 
Since this argument can go on indefinitely, we find that S = N. 

There is a subtle problem with this argument. It is unclear why “and so on” will work. After 
all, what does “and so on” or “continue in this manner” really mean? Can it really continue 
indefinitely? The trouble is, we do not have a formal definition of the natural numbers. It turns 
out that we cannot completely prove the principle of mathematical induction with just the usual 
properties for addition and multiplication. Consequently, we will take the theorem as an axiom 
without giving any formal proof. <0* 

Although we cannot provide a satisfactory proof of the principle of mathematical induction, 
we can use it to justify the validity of the mathematical induction. Let S be the set of integers 
n for which a propositional function P(n) is true. The basis step of mathematical induction 
verifies that 1 £ S. The inductive step shows that k £ S implies k + 1 £ S. Therefore, the 
principle of mathematical induction proves that S = N. It follows that P(n) is true for all 
integers n > 1. 

The basis step and the inductive step, together, prove that 

P(l) => P( 2) => P{ 3) =>•••. 
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Compare induction 
to the domino effect. 

• The first domino must fall to start the motion. If it does not fall, no chain reaction will 
occur. This is the basis step. 

• The distance between adjacent dominoes must be set up correctly. Otherwise, a certain 
domino may fall down without knocking over the next. Then the chain reaction will stop, 
and will never be completed. Maintaining the right inter-domino distance ensures that 
P(k) => P{k + 1) for each integer k > 1. 

To prove the implication 

P(k) => P(k + 1) 

in the inductive step, we need to carry out two steps: assuming that P{k) is true, then using it 
to prove P(k + 1) is also true. So we can refine an induction proof into a 3-step procedure: 

1. Verify that P( 1) is true. 

2. Assume that P(k) is true for some integer k > 1. 

3. Show that P(k + 1) is also true. 

The second step, the assumption that P(k) is true, is sometimes referred to as the inductive 
hypothesis or induction hypothesis . This is how a mathematical induction proof may look: 


Therefore, P(n) is true for all integers n > 1. Compare induction to falling dominoes. When 
the first domino falls, it knocks down the next domino. The second domino in turn knocks down 
the third domino. Eventually, all the dominoes will be knocked down. But it will not happen 
unless these conditions are met: 


Proof: We proceed by induction on n. When n = 1 , the left-hand 
side of the identity reduces to . . . , and the right-hand side becomes 
. . . . Hence, the identity holds when n = 1. Assume the identity 
holds when n = k for some integer k > 1; that is, assume 

(3.1) 

for some integer k > 1. We want to show that it also holds when 
n = k + 1; that is, we want to show that 


Using the inductive hypothesis (3.1), we find 


Therefore, the identity also holds when n = k + 1. This completes 
the induction. ■ 


The idea behind mathematical induction is rather simple. However, it must be delivered 
with precision. 

(i) Be sure to say “Assume the identity holds for some integer k > 1.” Do not say “Assume 
it holds for all integers k > 1.” If we already know the result holds for all k > 1, then 
there is no need to prove anything at all. 

(ii) Be sure to specify the requirement k > 1. This ensures that the chain reaction of the 
falling dominoes starts with the first one. 

(iii) Do not say “let n = fc” or “let n = k + 1.” The point is, you are not assigning the value 
of k and k + 1 to n. Rather, you are assuming that the statement is true when n equals 
k, and using it to show that the statement also holds when n equals k + 1. 
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Example 3.4.1 Use mathematical induction to show that 

n(n + 1) 




for all integers n > 1. 


Discussion. In the basis step, it would be easier to check the two sides of the equation separately. 
The inductive step is the key step in any induction proof, and the last part, the part that proves 
P(k + 1) is true, is the most difficult part of the entire proof. In this regard, it is helpful to 
write out exactly what the inductive hypothesis proclaims, and what we really want to prove. 
In this problem, the inductive hypothesis claims that 




k(k + 1) 


We want to prove that P(k + 1) is also true. What does P(k + 1) really mean? It says 

l + 2 + 3 + --- + (fc + l) = (fc + 1 H fc + 2) . 

Compare the left-hand sides of these two equations. The first one is the sum of k quantities, 
and the second is the sum of k + 1 quantities, and the extra quantity is the last number k + 1. 
The sum of the first k terms is precisely what we have on the left-hand side of the inductive 
hypothesis. Hence, by writing 

1 — I— 2 — |— 3 — (— • • • — I— (A; — |— 1) == 1 — |— 2 — |— • • • — |— /c — |- (A: — |— 1), 

we can regroup the right-hand side as 

1 + 2 + 3 + • • • + (fc + 1) = [1 + 2 + ■■■ + k] + (k + 1), 

so that 1 + 2 H + k can be replaced by Mfuhli, according to the inductive hypothesis. With 

additional algebraic manipulation, we try to show that the sum does equal to f fc + 1 H fc + 2 ) _ <£> 

Solution: We proceed by induction on n. When n = 1, the left-hand side of the identity reduces 
to 1, and the right-hand side becomes = 1; hence, the identity holds when n = 1. Assume it 
holds when n = k for some integer k > 1; that is, assume that 




k{k + 1) 


for some integer k > 1. We want to show that it also holds when n = k + 1. In other words, we 
want to show that 


1 + 2 + 3-I + (fc + 1) — 

Using the inductive hypothesis, we find 

1 + 2 + 3H + (£; + !) 


(k + 1) (k + 2) 


1 + 2 + 3 + • • • + fc + (fc + 1) 
k(k + 1) 

2 


+ (k + 1) 


(k + 1) ( 2 + 1 


= (k + 1) 


k 
2 

k + 2 


Therefore, the identity also holds when n = k + 1. This completes the induction. 
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We can use the summation notation (also called the sigma notation) to abbreviate a 
sum. For example, the sum in the last example can be written as 

n 

5 > 

i — 1 

The letter i is the index of summation. By putting i = 1 under ^ and n above, we declare 
that the sum starts with i = 1, and ranges through i = 2, i = 3, and so on, until i = n. The 
quantity that follows ^ describes the pattern of the terms that we are adding in the summation. 
Accordingly, 

10 

= l 2 + 2 2 + 3 2 + • • • + 10 2 . 

i = 1 

In general, the sum of the first n terms in a sequence {ai, a2, 03 , . . . } is denoted Y17=i Observe 
that 

fc+l / k \ 

^2 ai = ( ^2 ai ) + a/c+i ’ 
i=l \ i=l ) 

which provides the link between P(k + 1) and P(k) in an induction proof. 

Example 3.4.2 Use mathematical induction to show that, for all integers n > 1, 
yV = 1 2 + 2 2 + 3 2 + ,,, +n 2 = n(n+l)(2n+l) ' 


Solution : We proceed by induction on n. When n = 1, the left-hand side reduces to l 2 = 1, 
and the right-hand side becomes = 1; hence, the identity holds when n = 1. Assume it 
holds when n = k for some integer k > 1; that is, assume that 

, 2 k{k + 1)(2 k + 1) 



for some integer k > 1. 
want to show that 

fc+i 


E 


i = 1 


We want to show that it still holds when n = k + 1. In other words, we 

{k + l)(fc + 2) [2 {k + 1) + 1] {k + 1 )(k H- 2)(2 k + 3) 

= 6 = 6 ' 


From the inductive hypothesis, we find 


fc+l / k \ 

E * 2 = (^E i2 J +( fc +i ) 2 

= fc ( fc + 1 K 2fc+1 ) +(fc+1) 2 

= l(k + l)[k{2k + 1) + 6{k + 1)} 
= | {k + l)(2fc 2 + 7fc + 6) 

= ^ (fc + l)(fc + 2)(2/c + 3). 


▲ 


Therefore, the identity also holds when n = k + 1. This completes the induction. 
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Example 3.4.3 Use mathematical induction to show that 

(n + l)(5n + 6) 


3 + ^(3 + 5i) = 


for all integers n > 1. 


Solution: Proceed by induction on n. When n = 1, the left-hand side reduces to 3+(3+5) = 11, 
and the right-hand side becomes = 11; hence, the identity holds when n = 1. Assume it 
holds when n = k for some integer k > 1; that is, assume that 


3 + ^(3 + 5i) = 


(k + l)(5fc + 6) 


i=l 


for some integer k > 1. We want to show that it still holds when n = k + 1. In other words, we 
want to show that 

fc+i 


3 + ^(3 + 5 *) = 


[(k + 1) + 1] [5 (k + 1) + 6] _ (k + 2) (5k + 11) 


From the inductive hypothesis, we find 


fc+l / k \ 

3 + ^ ) ( 3 + 5 1) = I 3 + ^ " (3 + 5i) J + [3 + 5 (k -t~ 1)] 

i— 1 \ i=l ) 


(k + l)(5fc + 6) 
2 


-f- 5k -f 8 

\ [(k + l)(5fc + 6) + 2(5fc + 8)] 
| (5k 2 + 21 k + 22) 
b(k + 2)(5k + 11). 


This completes the induction. A 

Hands-On Exercise 3.4.1 It is time for you to write your own induction proof. Prove that 

1.2 + 2- 3 + 3. 4 + ... + n(n + l) = n( " + 1 ’ (n + 2) 

o 

for all integers n > 1. 

Remark. We give you a hand on this one, after which, you will be on your own. We lay out 
the template, all you need to do is fill in the blanks. <0> 

Solution: Proceed by induction on n. When n = 1, the left-hand side reduces to . . . 
and the right-hand side becomes ... . Hence, ... . As- 
sume the identity holds when n = k for ... ; that is, assume 


for some integer k > 1. We want to show that it also holds when n = k + 1; that is, we want to 
show that 


It follows from the inductive hypothesis that 
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= + 

This completes the induction. A 

Hands-On Exercise 3.4.2 Use induction to prove that, for all positive integers n, 


1-2-3 + 2- 3- 4 + -- - + n(ri T 1 )(/i -t- 2) 


n(n + l)(n + 2 )(n + 3) 
4 


A 

Use induction to prove that, for all positive integers n, 

l+4 + 4 2 + -- -+4" = - (4 n+1 - 1). 

3 


Hands-On Exercise 3.4.3 
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A 

All three steps in an induction proof must be completed; otherwise, the proof may not be 
correct. 

Example 3.4.4 Never attempt to prove P(k) =>■ P(k + 1) by examples alone. Consider 

P(n) : n 2 + n + 11 is prime. 

In the inductive step, we want to prove that 

P(k) => P(k + 1) for any k > 1. 

The following table verifies that it is true for 1 < k < 8: 


n 

1 

2 

3 

4 

5 

6 

7 

8 

9 

n 2 + n+ 11 

13 

17 

23 

31 

41 

53 

67 

83 

101 


Nonetheless, when n = 10, n 2 + n + 11 = 121 is composite. So P( 9) =£> P(10). The inductive 
step breaks down when k = 9. A 

Example 3.4.5 The basis step is equally important. Consider proving 

P(n) : 3n + 2 = 3q for some integer q 

for all n £ N. Assume P(k) is true for some integer k > 1; that is, assume 3fc + 2 = 3q for some 
integer q. Then 

3{k + 1) + 2 = 3fc + 3 + 2 = 3 + 3q = 3(1 + q). 

Therefore, 3 (k + 1) + 2 can be written in the same form. This proves that P(k + 1) is also true. 
Does it follow that P(n) is true for all integers n > 1? We know that 3 n + 2 cannot be written 
as a multiple of 3. What is the problem? 

Solution: The problem is: we need P{k) to be true for at least one value of k so as to start the 
sequence of implications 

P(1)=>P(2), P(2) => P(3), P(3) => P(4), 

The induction fails because we have not established the basis step. In fact, P(l) is false. Since 
the first domino does not fall, we cannot even start the chain reaction. A 

Remark. Thus far, we have learned how to use mathematical induction to prove identities. 
In general, we can use mathematical induction to prove a statement about n. This statement 
can take the form of an identity, an inequality, or simply a verbal statement about n. We shall 
learn more about mathematical induction in the next few sections. <0> 
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Summary and Review 

• Mathematical induction can be used to prove that a statement about n is true for all 
integers n > 1. 

• We have to complete three steps. 

• In the basis step, verify the statement for n = 1. 

• In the inductive hypothesis, assume that the statement holds when n = k for some integer 
k > 1. 

• In the inductive step, use the information gathered from the inductive hypothesis to prove 
that the statement also holds when n = k + 1. 

• Be sure to complete all three steps. 

• Pay attention to the wording. At the beginning, follow the template closely. When you 
feel comfortable with the whole process, you can start venturing out on your own. 

Exercises 3.4 

1. Use induction to prove that 

„o o n 2 (n+l) 2 

l 3 + 2 3 + 3 3 + • • • + n 3 = — - 

4 

for all integers n > 1. 

2. Use induction to prove that the following identity holds for all integers n > 1: 

I + 3 + 5 H + (2 n — 1) = n 2 . 

3. Use induction to show that 

II 1 _ 3 / 1 

1+ 3 + 32 + '" + iU _ 2V 

for all positive integers n. 

4. Use induction to establish the following identity for any integer n > 1: 

1 — (— 3')”+ 1 

1-3 + 9 + (— 3) n = ^ . 

v 4 

5. Use induction to show that, for any integer n > 1: 

n 

• i\ = (n + 1)! — 1. 

i - 1 

6. Use induction to prove the following identity for integers n > 1: 

n . 

E l n 

(2 i - l)(2z + 1) = 2n + 1 ' 


7. Evaluate 


E 


“ i{i + 1) 

t—i v 7 


for a few values of n. What do you think the result should be? Use 


induction to prove your conjecture. 


8. Use induction to prove that 


^(2 i - l) 3 = n 2 (2n 2 - 1) 


whenever n is a positive integer. 
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See Hands-On 
Exercise 3.2.4. 


9. Use induction to show that, for any integer n > 1: 


l 2 - 2 2 + 3 2 - 


+ (-l) r 


= (- 1 ) 


,_i n(n+l) 


10. Use mathematical induction to show that 


E 


i + 4 

i(i + 1) (i T 2) 


n(3n + 7) 

2 (n + 1) (n + 2) 


for all integers n > 1. 


3.5 More on Mathematical Induction 

Besides identities, we can also use mathematical induction to prove a statement about a positive 
integer n. 


Example 3.5.1 Prove that n(n + 1)(2 n + 1) is a multiple of 6 for all integers n > 1. 

Remark. We have already seen how to prove this claim using a proof by cases, which is 
actually an easier way to prove that n(n + l)(2n + 1) is divisible by 6. Nonetheless, we shall 
demonstrate below how to use induction to prove the claim. <0> 

Discussion. In the inductive hypothesis, it is clear that we are assuming k(k + 1)(2 k + 1) is a 
multiple of 6. In the inductive step, we want to prove that 

(k + 1 )(k + 2)[2(fc + 1) + 1] = (k + 1 )(fc + 2)(2fc + 3) 

is also a multiple of 6. A multiple of 6 can be written as 6q for some integer q. Since we have 
two multiples of 6, we need to write 


k(k + 1)(2 k + 1) = 6q 

and 

(k + l)(fc + 2)(2fc + 3) =6 Q 

to distinguish them. By using the lowercase and uppercase of the same letter, we indicate that 
they are different values. Yet, because they come from the same letter, they both share some 
common attribute, in this case, being the quotients when the respective values are divided by 6. 

Now, in the inductive step, we need to make use of the equation k(k + 1)(2 k + 1) = 6q from 
the inductive hypothesis. This calls for connecting the product {k + l)(fc + 2)(2 k + 3) to the 
expression k(k + 1)(2 k + 1). Since they share the common factor k + 1, what remains to do is 
write (fc + 2) (2 k + 3) in terms of k(2k + 1). 

We are asked to prove that ?r(n + l)(2n + l) is a multiple of 6. This is not an identity. There- 
fore, do not say “assume/show that the identity holds when ... .” Instead, say “assume/show 
that the claim is true when . . . .” <^> 

Solution: Proceed by induction on n. When n = 1, we have n(n + l)(2?i + 1) = 1 • 2 • 3 = 6, 
which is clearly a multiple of 6. Hence, the claim is true when n = 1. Assume the claim is true 
when n = k for some integer k > 1; that is, assume that we can write 

k(k + 1)(2 k + 1) = 6q 

for some integer q. We want to show that the claim is still true when n = k + 1; that is, we 
want to show that 


( k + l){k + 2)[2 (k + 1) + 1] — (k + 1 )(fc + 2) (2k + 3) — 6 Q 
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for some integer Q. Using the inductive hypothesis, we find 

(k + l)(k + 2)(2fc + 3) = (k + l)(2/c 2 + 7k + 6) 

= (k + l)[(2fc 2 + k) + (6k + 6)] 

= (k + l)[k(2k + 1) + 6(fc + 1)] 

= k{k + 1)(2 k + 1) + 6 (k + l) 2 
= 6 q + 6 (k + l) 2 

= 6[q + (k + l) 2 ], 

where q + (k + l) 2 is clearly an integer. This completes the induction. A 

Hands-On Exercise 3.5.1 Prove that n 2 + 3n + 2 is even for all integers n > 1. 
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Hence, it follows from the inductive hypothesis that 


fc+i 

E 


i=l 


1 



1 / o 1 1 

(k + l) 2 -^k + Jk + 


1) 


2 ' 


The proof would be complete if we could show that 

o 1 , 1 <o 1 

k (k + I) 2 " k+ 1 ‘ 

There is no guarantee that this idea will work, but this should be the first thing we try. 
After rearrangement, the inequality becomes 

1 1 ^ 1 
k + 1 + {k + l) 2 “ k’ 

which is equivalent to ^+i) 2 < Cross-multiplication yields 

k(k + 2) < (fc + 1) 2 . 


Since 

k{k + 2) = fc 2 + 2fc, and (fc + l) 2 = fc 2 + 2fc + 1, 
it is clear that what we want to prove is indeed true. <0> 

Polish It Up! Next, we rearrange the argument to make it read more smoothly. Essentially all 
we need is to run the argument backward. To improve the flow of the argument, we can prove 
a separate result on the side before we return to the main argument. $> 


Proof 1: Proceed by induction on n. When n = 1, the left-hand side becomes 1, and so does 
the right-hand side; thus, the inequality holds. Assume it holds when n = k for some integer 
k > 1: 


E 


i 


< 2 - 


1 

k' 


We want to show that it also holds when n = k + 1: 


fc+i 

E 




i 


< 2 - 


1 

k+1 


To finish the proof, we need to derive an inequality. Notice that 

k(k + 2) = k 2 + 2k < k 2 + 2k + 1 = (fc + l) 2 . 

Hence, after dividing both sides by k{k + l) 2 , we obtain 

k + 2 ^ 1 

(k + 1) 2 < k' 

This leads to 

1 1 (fc + l) + l k + 2 1 

k + 1 + (k + 1) 2 ~ (k + 1) 2 ~ (k + 1) 2 < k' 

which is equivalent to 

1 11 

~k + (k + l) 2 < ~ k + 1' 


(3.2) 
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We now return to our original problem. It follows from the inductive hypothesis and (3.2) 


that 


fc+i 


E^ = E^ 


i 


< 2 - - 
< 2 - 


(fc + l ) 2 

1 


k (k + l ) 2 

1 


k + 1 


Therefore, the inequality still holds when n = k + 1, which completes the induction. A 

Remark. The key step in the proof is to establish (3.2), which can be done by means of 
contradiction. <£> 

Proof 2: Proceed by induction on n. When n = 1, the left-hand side becomes 1, and so does 
the right-hand side; thus, the inequality holds. Assume it holds when n = k for some integer 
k > 1: 

e4< 2 -w 

' i 2 k 

i= 1 

We want to show that it also holds when n = k + 1: 

k+l 


1=1 


fc + 1 


To finish the proof, we need the following inequality. We claim that 


1 


1 


< 


1 


k (k + 1) 2 k + l' 


(3.3) 


Suppose, on the contrary, that 


> -- 


k (k + l ) 2 ~ k + l' 

Clear the denominators by multiplying k(k + l) 2 to both sides of the inequality. We find 

— (k + l) 2 + k > —k(k + 1), 

or equivalently, 

— k 2 — k — 1 > — k 2 — k, 

which is the same as saying —1 > 0. This contradiction proves that (3.3) must be true. 

We now return to our original problem. It follows from the inductive hypothesis and (3.3) 
that 


k+l i / fc i N 

£3 = £3 


1 


2=1 


V2=l 

1 


< 2 — — 

< 2 — 


(fc + l ) 2 

1 


fc (fc + 1) 2 

1 


fc + 1 ' 


Therefore, the inequality still holds when n = k + 1, which completes the induction. 
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Hands-On Exercise 3.5.2 Show that n < 2" for all integers n > 1. 


A 

We do not have to start with n = 1 in the basis step. We can start with any integer ?x 0 . 

Generalization. To show that P(n) is true for all integers n > no, follow these steps: 

1. Verify that P(no) is true. 

2. Assume that P(k) is true for some integer k > no . 

3. Show that P(k + 1) is also true. 

The major difference is in the basis step: we need to verify that P(no) is true. In addition, in 
the inductive hypothesis, we need to stress that k > no . 

Example 3.5.3 Use mathematical induction to show that 

n 

^4 i = -(4"+ 1 - 1) 

i — 0 

for all integers > 0. 

Solution: Proceed by induction on n. When n = 0, the left-hand side reduces to ^° =0 4* = 
4° = 1, and the right-hand side becomes |(4 1 — 1) = | • 3 = 1. Hence, the formula holds when 
n = 0. Assume it holds when n = k for some integer k > 0; that is, assume 

E 4< = ^(4 fe+1 -i)- 

;=o 

We want to show that it also holds when n = k + 1; that is, 

fc+i 

E 4i = ^(4 fc+2 -i)- 
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Using the inductive hypothesis, we find 

k + 1 / k \ 

J2 4 ,; = ^ 4* + 4 fc+1 

i=0 \i=0 / 

= § (4 fc+1 - 1) +4 fe+1 

= i (4 fc+1 - l + 3-4 fc+1 ) 

= l (4 • 4 fc+1 - 1) 

= | (4 fc+2 - 1), 

which is what we want to prove, thereby completing the induction. 

Hands-On Exercise 3.5.3 


Prove that, for any integer n > 0, 


2 4 

3 + 9 


= 3 


-a) 


n+ 1 


▲ 


A 


Example 3.5.4 Use mathematical induction to show that 


n" > 2 n 


for all integers n > 2. 

Solution: Proceed by induction on n. When n = 2, the inequality becomes 2 2 > 2 2 , which is 
obviously true. Assume it holds when n = k for some integer k > 2: 

k k > 2 k . 

We want to show that it still holds when n = k + 1: 

(k + l) fc+1 > 2 k+1 . 
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Since fc > 2, it follows from the inductive hypothesis that 

(k + l) fc+1 > k k+1 = k ■ k k > 2 • 2 k = 2 k+1 . 

Therefore, the inequality still holds when n = k + 1. This completes the induction. ▲ 

Summary and Review 

• We can use induction to prove a general statement involving an integer n. 

• The statement can be an identity, an inequality, or a claim about the property of an 
expression involving n. 

• An induction proof need not start with n = 1. 

• If we want to prove that a statement is true for all integers n > ?i 0 , we have to verify the 
statement for n = no in the basis step. 

• In addition, we need to assume that k > no in the inductive hypothesis. 

Exercises 3.5 

1. Use induction to prove that n(n + l)(n + 2) is a multiple of 3 for all integers n > 1. 

2. Use induction to show that n 3 + 5 n is a multiple of 6 for any nonnegative integer n. 

3. Use induction to prove that 

2 + ( 1 -\ — -j=. H — -j=. + • • • H — ) > 2 \J n -\ 1 
V V2 V3 Vn) 

for all integers n > 1. 

4. Use induction to prove that 

2("l+^ + - l - + --- + ^ <3 ^ 

\ 8 27 n 3 J n 2 

for all integers n > 1. 

5. Use induction to prove that 

2 a(r” +1 - 1) 

a + ar + ar ^ H b ar n = — — - 

r — 1 

for all nonnegative integers n, where a and r are real numbers with r/ 1. 

6. Use induction to prove that, for any integer n > 2, 

n 

6 i(i + 2) = 2n 3 + 9n 2 + 7n — 18. 

i = 2 

7. Use induction to prove that, for any integer n > 0, 



8. Use induction to show that n! > 2" for all integers n > 4. 

9. Use induction to prove that n 2 > 4n + 1 for all integers n > 5. 

10. Prove that 2n + 1 < 2” for all integers n > 3. 
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11. Define 

o 1 , 2 , 3 , n 

n ~ 2! + 3! + 4! + ‘" + ( n + l)!' 

(a) Evaluate S n for n = 1, 2, 3, 4, 5. 

(b) Propose a simple formula for S n . 

(c) Use induction to prove your conjecture for all integers n > 1. 


12 . 


Define T n = 

i = o 


1 

(2i + l)(2i + 3)' 


(a) Evaluate T n for n = 0, 1, 2, 3, 4. 

(b) Propose a simple formula for T n . 

(c) Use induction to prove your conjecture for all integers n > 0. 


3.6 Mathematical Induction: The Strong Form 

You may have heard of Fibonacci numbers. They occur frequently in mathematics and life 
sciences. They have even been applied to study the stock market! Fibonacci numbers form a 
sequence every term of which, except the first two, is the sum of the previous two numbers. 
Mathematically, if we denote the nth Fibonacci number F n , then 

F n = F n _ i + F n _ 2 - (3-4) 

This is called the recurrence relation for F n . 

Some students have trouble using (3.4): we are not adding n — 1 and n — 2. The subscripts 
only indicate the locations within the Fibonacci sequence. Hence, F\ means the first Fibonacci 
number, F% the second Fibonacci number, and so forth. Compare this to dropping ten numbers 
into ten boxes, and each box is labeled with the numbers 1 through 10. Let us use a* to denote 
the value in the *th box. When we say < 27 , we do not mean the number 7. Instead, we mean 
the number stored in Box 7. Expressed in words, the recurrence relation (3.4) tells us that the 
nth Fibonacci number is the sum of the (n — l)th and the (n — 2)th Fibonacci numbers. This is 
easy to remember: we add the last two Fibonacci numbers to get the next Fibonacci number. 

The recurrence relation implies that we need to start with two initial values. We often start 
with Fq = 0 (image Fo as the zeroth Fibonacci number, the number stored in Box 0) and F\ = 1. 
We combine the recurrence relation for F n and its initial values together in one definition: 

Fo = 0, Fi = 1, F n = F n _i + F„_ 2 , for n > 2. 

We have to specify that the recurrence relation is valid only when n > 2, because this is the 
smallest value of n for which we can use the recurrence relation. What happens if you want to 
find F\ using this formula? You will get Fi = Fq + F_ 1 , but F_ 1 is undefined! 

The sum of the zeroth and the first Fibonacci numbers give us the second Fibonacci number: 


Continuing in this fashion, we find 


F2 = Fi + F 0 

= 1 + 0 = 1 . 


we find 

F3 = F2 + F\ 

= 1 + 1 = 

2 , 

F4 = F 3 + F2 

= 2 + 1 = 

3, 

F 5 = F4 + F 3 

= 3 + 2 = 

5, 

Fq = Fq + F 4 

= 5 + 3 = 

8 , 


Following this pattern, what are the values of Fi and F 3 ? 
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Fibonacci numbers enjoy many interesting properties, and there are numerous results con- 
cerning Fibonacci numbers. As a starter, consider the property 

F n < 2”, n > 1. 


How would we prove it by induction? 

Since we want to prove that the inequality holds for all n > 1, we should check the case of 
n = 1 in the basis step. When n = 1, we have Fj = 1 which is, of course, less than 2 1 = 2. 
In the inductive hypothesis, we assume that the inequality holds when n = k for some integer 
k > 1; that is, we assume 

F k < 2 k 

for some integer k > 1. Next, we want to prove that the inequality still holds when n = k + 1. 
So we need to prove that 

F k+ 1 <2 fc+1 . 

To make use of the inductive hypothesis, we need to apply the recurrence relation of Fibonacci 
numbers. It tells us that F k+ 1 is the sum of the previous two Fibonacci numbers; that is, 

F k+ 1 = F k + F fc _ i. 

The only thing we know from the inductive hypothesis is F k < 2 k . 
tell us much about F k + 1 . 

A remedy is to assume in the inductive hypothesis that the 
n = k — 1; that is, we also assume that 

F k -\ < 2 k ~\ 

Therefore, unlike all the problems we have seen thus far, the inductive step in this problem relies 
on the last two n - values instead of just one. In terms of dominoes, imagine they are so heavy 
that we need the combined weight of two dominoes to knock down the next. Then 

F k+ 1 = F k + F fc _ i < 2 k + 2 k ~ 1 = 2 fc_1 (2 + 1) < 2 fc " 1 • 2 2 = 2 k+ \ 

which will complete the induction. This modified induction is known as the strong form of 
mathematical induction. In contrast, we call the ordinary mathematical induction the weak 
form of induction. 

The proof still has a minor glitch! To be able to use the inductive hypothesis in the recurrence 
relation 

F k+ 1 = F k + F fc _ i, 

both subscripts k and k — 1 must be at least 1, because the statement claims that F n < 2 n for 
all n > 1. This means we need k > 2. Consequently, in the basis step, we have to assume the 
inequality holds for at least the first two values of n. 

In terms of the domino effect, the chain reaction of the falling dominoes starts at k = 2. We 
have to make sure that the first two dominoes will fall, so that their combined weight will knock 
down the third domino. Then the combined weight of the second and the third dominoes will 
knock over the fourth domino. The chain reaction will carry on indefinitely. 

Symbolically, the ordinary mathematical induction relies on the implication P(k) => P(k+ 1). 
Sometimes, P(k) alone is not enough to prove P(k + 1). In the case of proving F n < 2 n , we 
actually use 

[P(k — 1) A P(k)\ =>P(fc + l). 

We need to assume in the inductive hypothesis that the result is true when n = k— 1 and n = k. 

More generally, in the strong form of mathematical induction, we can use as many previous 
cases as we like to prove P(k + 1). 

Strong Form of Mathematical Induction. To show that P(n ) is true for all n > no, follow 

these steps: 


So, as it stands, it does not 
inequality also holds when 
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1. Verify that P(n) is true for some small values of n > no- 

2. Assume that P{n) is true for n = no, no + 1, . . . , k for some integer k > n*. 

3. Show that P(k + 1) is also true. 

The idea behind the inductive step is to show that 

[ P(n 0 ) A P(n 0 + 1) A • • • A P(k - 1) A P(k) ] +> P(fc + 1). 

We may not need to use all of P(no), P(no + 1), . . . , P(fc — 1), P(fc). In fact, we may only need 
the last few of them, for example, P(k — 3), P(k — 2), P(k — 1) and P(k). The number of previous 
cases required to establish P(fc + 1) tells us how many initial cases we have to verify in the basis 
step. We do not know how many we need until the inductive step. For this reason, it is wise to 
start with a draft. 


Example 3.6.1 Show that F n < 2" for all n > 1. 

Remark. We have already worked on the draft in the discussion above. We know that we 
need to verify the first two n-values in the basis step, and to assume that the inequality holds 
for at least two cases. <0> 


Solution: Proceed by induction on n. When n = 1 and n = 2, we find 

F 1 = l<2 = 2 1 , 

F 2 = 1 < 4 = 2 2 . 


Therefore, the inequality holds when n = 1, 2. Assume it holds for n = 1, 2, . . . , k, where k > 2. 
In particular, we have 

Pfc < 2 fe , and Pfc-i < 2 fc_1 , 

where k > 2. Then 

F k+1 = F k + P fc _ i < 2 k + 2 fc_1 = 2 fc-1 (2 + 1) < 2 fc_1 • 2 2 = 2 k+1 . 

Hence, the inequality still holds when n = k + 1, which completes the induction. A 

Recurrence relation can be used to define a sequence. For example, if the sequence {a n }“ =1 
is defined recursively by 

a n = 3a„_i — 2 for n > 2, 

with ai = 4, then 


ct 2 — 3 ci\ — 2 — 3*4 — 2 — 10, 
ci3 = 3^2 — 2 = 3* 10 — 2 = 28. 


Identity involving such sequences can often be proved by means of induction. 

Example 3.6.2 The sequence {bn}^^ is defined as 

bi =5, b 2 = 13, b n = 56„_i - 6 6„_ 2 for n > 3. 

Prove that b n = 2 n + 3" for all n > 1 . 

Solution: Proceed by induction on n. When n = 1, the proposed formula for b n says b\ = 
2 + 3 = 5, which agrees with the initial value 61 = 5. When n = 2, the proposed formula claims 
b 2 = 4 + 9 = 13, which again agrees with the definition b 2 = 13. Assume the formula is valid for 
n = 1,2, ... ,k for some integer k >2. In particular, assume 

b k = 2 k + 3 fc , and 6 fc _ 1 = 2 k ~ l + S*" 1 . 


We could use more 
than one previous 
case, say, n = k, 
k — 1 , k — 2, . . . , to 
establish the next 
case n = k + 1. 
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We want to show that the formula still works when n = k + 1. In other words, we want to show 
that 

b k+ 1 = 2 fc+1 + 3 fc+1 . 

Using the recurrence relation and the inductive hypothesis, we find 

b k + i = 5 b k - 66 fe _! 

= 5(2 fe + 3 fc ) — 6(2 fc_1 + 3 fc_1 ) 

= 5 • 2 fc + 5 • 3 fe - 6 • 2 fc_1 - 6 • 3 fc_1 

= 5 • 2 fc + 5 • 3 fc — 2 • 3 • 2 fc_1 — 2 • 3 • 3 fc_1 

= 5 • 2 fc + 5 • 3 fc - 3 • 2 fc - 2 • 3 fc 

= 2 • 2 fc + 3 • 3 fe 

c^k~ 1~ 1 | 

which is what we want to establish. This completes the induction, and hence, the claim that 
b n = 2 n + 3 n . A 

Hands-On Exercise 3.6.1 The sequence is defined as 

Ci = 7, b '2 = 29, c n = 5b n -i — d>b n -2 for n > 3. 

Prove that c n = 5 • 3™ — 4 • 2" for all integers n > 1. 


A 

Example 3.6.3 Show that all integers n > 24 can be expressed as Ax + 9 y for some integers 
x,y > 0. 


Definition. The expression Ax + 9 y is called a linear combination of 4 and 9, and x and y 
are called the coefficients of the linear combination. <^> 
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Remark. We want to prove that any sufficiently large integer n can be written as a linear 
combination of 4 and 9 with nonnegative coefficients. This problem is called the postage stamp 
problem for the obvious reason: can we use only 4-cent and 9-cent stamps to obtain an n-cent 
postage for all integers n > 24? Not too surprisingly, it is also called the money changing 
problem (imagine replacing stamps with coins). <C> 

Remark. The spirit behind mathematical induction (both weak and strong forms) is making 
use of what we know about a smaller size problem. In the weak form, we use the result from 
n = k to establish the result for n = k + 1. In the strong form, we use some of the results from 
n = k, k — 1, k — 2, . . . to establish the result for n = k + 1. <0> 

Discussion. Let us first look at the inductive step, in which we want to show that we can write 
k + 1 as a linear combination of 4 and 9. The key step of any induction proof is to relate the 
case of n = k + 1 to a problem with a smaller size (hence, with a smaller value in n). 

Imagine you want to send a letter that requires a (k + l)-cent postage, and you can use only 
4-cent and 9-cent stamps. You could first put down a 4-cent stamp. Then you still need to come 
up with the remaining postage of (k + 1) — 4 = k — 3 cents. If you could use 4-cent and 9-cent 
stamps to make up the remaining ( k — 3)-cent postage, the problem is solved. Therefore, in the 
inductive hypothesis, we need to assume that it can be done when n = k — 3. 

For the whole argument to work, k — 3 has to be within the range of the n-values that we 
consider. So we need k — 3 > 24; that is, we want k > 27. Consequently, we have to verify the 
claim for n = 24, 25, 26, 27 in the basis step. <£> 

Solution: Proceed by induction on n. We find 


24 = 

4- 6 + 9-0, 

25 = 

4 • 4 + 9 • 1, 

26 = 

4- 2 + 9 -2, 

27 = 

4- 0 + 9 -3. 


Hence, the claim is true when n = 24, 25, 26, 27. Assume it is true when n = 24, 25, . . . , k for 
some integer k > 27. In particular, since k — 3 > 24, this assumption assures that 

k — 3 = Ax + 9y 

for some nonnegative integers x and y. It follows that 

k + 1 = 4 + (k - 3) 

= 4 + Ax + 9y 
= 4(1 + x) + 9y, 

where 1+x and y are nonnegative integers. This shows that the claim is still true when n = /c+1, 
thereby completing the induction. A 

Hands-On Exercise 3.6.2 Show that all integers n > 2 can be expressed as 2x + 3 y for some 
nonnegative integers x and y. 
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A 


Summary and Review 

• If, in the inductive step, we need to use more than one previous instance of the statement 
that we are proving, we may use the strong form of the induction. 

• In such an event, we have to modify the inductive hypothesis to include more cases in the 
assumption. 

• We also need to verify more cases in the basis step. 

• Finally, we need to rewrite the whole proof to make it coherent. 

Exercises 3.6 

1. Use mathematical induction to prove the identity 

Fl + Fl + Fl + • • • + Fl = F n F n+1 

for any integer n > 1. 

2. Use induction to prove the following identity for all integers n > 1: 

Fi + F3 + F 5 + ■ ■ ■ + F 2n -i = F 2n . 

3. Use induction to prove that 

F\ F 2 F 3 F n _ 2 _ i 1_ 

F 2 F 3 + F 3 F 4 + F 4 F 5 + " ' + F n _ 4 F n ~ F n 

for all integers n > 3. 

4. Use induction to prove that any integer n > 8 can be written as a linear combination of 3 
and 5 with nonnegative coefficients. 

5. A football team may score a field goal for 3 points or 1 a touchdown (with conversion) for 
7 points. Prove that, for any integer n > 12, it is possible for a football team to score n 
points with held goals and touchdowns. 

6. An island country only issues 1-cent, 5-cent and 9-cent coins. Due to shortage in copper, 
all 1-cent coins were recalled. Prove that, using just 5-cent and 9-cent coins, one can pay 
an n- cent purchase for any n > 32. 

1 Although it is possible for a team to score 2 points for a safety or 8 points for a touchdown with a two-point 
conversion, we would not consider these possibilities in this simplified version of a real football game. 
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7. The sequence {bn}™^ is defined recursively by 

b n = 3&„_i — 2 for n > 2, 

with bi = 4. Use induction to prove that b n = 3" + 1 for all n > 1. 

8. The sequence { c„ }£?_! is defined recursively as 

ci = 3, C2 = —9, c n = 7c n -\ — 10c„_2, for n > 3. 
Use induction to show that c n = 4 • 2" — 5" for all integers n > 1. 

9. The sequence is defined recursively as 

di = 2, ^2 = 56, d n = d n - 1 + 6d n -2, for n > 3. 

Use induction to show that d n = 5(— 2) n +4-3” for all integers n > 1. 

10. The sequence {an}^! is defined recursively as 

ai = 2, 02 = 4, a n = 2a„_i + 3a n _2, for n > 3. 

Use induction to show that a n > (|)" for any integer n > 4. 
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Sets 


4.1 An Introduction 

A set is a collection of objects. The objects in a set are called its elements or members . The 
elements in a set can be any types of objects, including sets! The members of a set do not even 
have to be of the same type. For example, although it may not have any meaningful application, 
a set can consist of numbers and names. 

We usually use capital letters such as A , B , C, S, and T to represent sets, and denote 
their generic elements by their corresponding lowercase letters a, b, c, s, and t, respectively. To 
indicate that b is an element of the set B , we adopt the notation b £ B, which means “6 belongs 
to B" or “6 is an element of B.” We also write B 9 6, and say “B contains b (as an element).” 
We designate these notations for some special sets of numbers: 

R. = the set of real numbers, 

Q = the set of rational numbers, 

Z = the set of integers, 

N = the set of natural numbers (positive integers). 

All these are infinite sets, because they all contain infinitely many elements. In contrast, finite 
sets contain finitely many elements. 

We can use the roster method to describe a set if it has only a small number of elements. 
We list all its elements explicitly, as in 

A = the set of natural numbers not exceeding 7 = {1, 2, 3, 4, 5, 6, 7}. 

For sets with more elements, show the first few entries to display a pattern, and use an ellipsis 
to indicate “and so on.” For example, 


{1,2,3,. ..,20} 

represents the set of the first 20 positive integers. The repeating pattern can be extended 
indefinitely, as in 

N = {1,2,3,...} 

Z = {...,-2, -1,0, 1,2,...} 

There are three kinds of integers: positive, negative, and the signless integer zero. In regards to 
parity, an integer is either even or odd. An integer is even if and only if it is divisible by two. 
Therefore, the set of even integers can be described as {. . . , —4, —2, 0, 2,4,...}. 

We can use a set-builder notation to describe a set. For example, the set of natural 
numbers is defined as 


N = {x G Z | x > 0}. 
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Here, the vertical bar | is read as “such that” or “for which.” Hence, the right-hand side of the 
equation is pronounced as “the set of x belonging to the set of integers such that x > 0,” or 
simply “the set of integers x such that x > 0.” In general, this descriptive method appears in 
the format 

{membership | properties}. 

The notation | means “such that” or “for which” only when it is used in the set notation. It may 
mean something else in a different context. Therefore, do not write “let a; be a real number | 
x 2 > 3” if you want to say “ let a; be a real number such that x 2 > 3.” It is considered improper 
to use a mathematical notation as an abbreviation. 

Example 4.1.1 Write these two sets 

{x € Z \ x 2 < 1} and {x € N | x 2 < 1} 
by listing their elements explicitly. 

Solution: The first set has three elements, and equals {—1, 0, 1}. The second set is a singleton 
set; it is equal to {1}. ▲ 

Hands-On Exercise 4.1.1 Use the roster method to describe the sets {x € Z | x 2 < 20} and 
{x e N | x 2 < 20}. 


A 


Hands-On Exercise 4.1.2 Use the roster method to describe the set 
{x £ N | x < 20 and x = n 2 for some integer n}. 


A 

There is a slightly different format for the set-builder notation. Before the vertical bar, we 
describe the form the elements assume, and after the vertical bar, we indicate from where we 
are going to pick these elements: 


{ pattern | membership } . 

Here the vertical bar | means “where.” For example, 

{x 2 | x € Zj 

is the set of x 2 where x € Z. It represents the set of squares: {0, 1, 4, 9, 16, 25, . . .}. 


Example 4.1.2 The set 


{2n \ n e Zj 


A 


describes the set of even numbers. We can also write the set as 2Z. 
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Hands-On Exercise 4.1.3 


Describe the set {2?z + 1 | n G Z} with the roster method. 


A 


Hands-On Exercise 4.1.4 Use the roster method to describe the set {3 n \ n G Z}. 


A 

An interval is a set of real numbers, all of which lie between two real numbers. Should 
the endpoints be included or excluded depends on whether the interval is open , closed, or 
half- open. We adopt the following interval notation to describe them: 

(a, b) = {x € R. | a < x < b}, 

[a, 6] = {x G K | a < x < b}, 

[a,b) = {x G R \ a < x < b} , 

(a, b] = {x G R \ a < x < b} . 

It is understood that a must be less than or equal to b. Hence, the notation (5, 3) does not make 
much sense. How about [3,3]? Is it a legitimate notation? 

An interval contains not just integers, but all the numbers between the two endpoints. By 
numbers, we mean whole numbers and decimal numbers. For instance, (1, 5) {2, 3, 4} because 

the interval (1,5) also includes decimal numbers such at 1.276, \/2, and i r. 

We can use ±oo in the interval notation: 

(a, oo) = {x € R. | a < x}, 

(—oo, a) = {x € R | x < a}. 

However, we cannot write (a,oo] or [— oo,a), because ±oo are not numbers. It is nonsense to 
say x < oo or — oo < x. For the same reason, we can write [a,oo) and (— oo,a], but not [a,oo] 
or [—oo, a]. 


Example 4.1.3 Write the intervals (2,3), [2,3], and (2,3] in the descriptive form. 


Solution: According to the definition of 

an interval, we find 

(2,3) = 

{iel 

CO 

V 

V 

CM 

[2,3] = 

[jeR 

2 < x < 3}, 

(2,3] = 

[itR 

CO 

VI 

V 

CM 

What would you say about [2,3)? 



Example 4.1.4 Write these sets 




{x € R | — 2 < x < 5} and {x € R | x 2 < 1} 

in the interval form. 


Solution: The answers are [—2,5) and [—1,1], respectively. The membership of x affects the 
answers. If we change the second set to {x G Z | x 2 < 1}, the answer would have been { — 1, 0, 1}. 
Can you explain why {—1, 0, 1} [—1, 1]? A 


The interval (a, b) 
includes decimal 
numbers. 


oo is not a number, 
neither is —oo 
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See Example 2.1.8. 


Example 4.1.5 Be sure you are using the right types of numbers. Compare these two sets 

S = {x € Z \ x 2 < 5}, 

T = {x G R | x 2 < 5}. 

One consists of integers only, while the other contains real numbers. Thus, S = {—2, —1, 0, 1, 2}, 
and T = [ — \/5, y/5 ] . A 

Hands-On Exercise 4.1.5 Which of the following sets 

{x € Z | 1 < x < 7} and {x € R | 1 < x < 7} 
can be represented by the interval notation (1,7)? Explain. 


Hands-On Exercise 4.1.6 Explain why [2, 7] ^ {2, 3, 4, 5, 6, 7}. 


A 


Hands-On Exercise 4.1.7 True or false: (—2,3) = {—1,0, 1,2}? Explain. 


A 


A 


Let S' be a set of numbers; we define 

S + = {a: £ S | x > 0}, 

S~ = {a; € S | x < 0}, 

S* = {a: G S | x ± 0}. 

In plain English, S + is the subset of S containing only those elements that are positive, S~ 
contains only the negative elements of S, and S* contains only the nonzero elements of S. 

Example 4.1.6 It should be obvious that N = Z + . A 

Hands-On Exercise 4.1.8 What is the notation for the set of negative integers? 


A 


Some mathematicians also adopt these notations: 

bS = {bx | x € S}, 
a + bS = {a + bx \ x € S}. 
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Accordingly, we can write the set of even integers as 2Z, and the set of odd integers can be 
represented by 1 + 2Z. 

An empty set is a set that does not contain any element. Both 

{x € K. | x > 0 and x < 0} and {a: € K. | x 2 < 0} 

are examples of empty sets. The second example illustrates a typical application of an empty 
set. It provides a convenient way of declaring that a problem has no solution: we say that the 
solution set is an empty set. We denote an empty set with the notation 0 or { }. For example, 
can you explain why (3, 3) = 0? 

Hands-On Exercise 4.1.9 What does the notation [7, 7] mean? How would you describe the 
sets (7,7), (7,7] and [7,7)? 


A 

Example 4.1.7 Determine which of these statements are true. 

{i e K I (i 2 + 2){x 2 + 3) = 0} = 0, 

{x G Z | (x 2 — 2)(x 2 + 3) = 0} = 0, 

{reR| ( x 2 — 2)(x 2 + 3) = 0} = 0, 

{a; G R | (x 2 — 2)(x 2 + 3) > 0} = 0. 

Solution: The answers are: true, true, false, and false, respectively. A 

Example 4.1.8 When we write 3, 4, 5, . . . , n, we are referring to a list of integers between 3 
and n, inclusive. It is understood that n > 3. Consequently, the set 

{3,4,5, ... ,ra} 


is empty when n = 2. A 

Two sets A and B are said to be equal if they contain the same collection of elements. More 
rigorously, we define 

A = B<=>\/x(x£A<£>x£ B). 

Since the elements of a set can themselves be sets, exercise caution and use proper notation 
when you compare the contents of two sets. 

Example 4.1.9 Explain why {0, {1}} ^ {0,1}. 

Solution: The set {0, {1}} consists of two elements: the integer 0 and the set {1}. The set 
{0, 1} also consists of two elements, both of them integers; namely, 0 and 1. 

You may find the following analogy helpful. Imagine a set being a box. You open a box 
to look at its contents. The box itself can be compared to the curly braces { and }. What it 
holds is exactly what we call the elements of the set it represents. The contents of the two sets 
{0, {1}} and {0, 1} are depicted in the boxes shown in Figure 4.1. 

When you open the first box, you find two items. One of them is the number 0; the other is 
another box that contains the number 1. The second box also contains two items that are both 
numbers. What you find in these two boxes is not the same. Hence, the sets they represent are 
different. A 


A set may contain 
another set as an 
element. 
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Pay attention 
to the details. 


| j4.| is nonnegative. 



Figure 4.1: The two sets {0, {1}} and {0, 1}. 


Hands-On Exercise 4.1.10 


Name some differences between the sets {0, {1}} and {{0}, {1}}. 


A 


Example 4.1.10 True or false: Z ={{... , —3, —2, —1}, 0, {1, 2, 3, . . .}}? 

Solution: The set on the left is Z, and 

Z = {...,-3,-2, -1,0, 1,2,3, 

It is an infinite set. The set on the right consists of only three elements: 

(i) the set 3,— 2,-1}, which is the set of negative integers, 

(ii) the integer 0, and 

(iii) the set {1, 2, 3, . . which is the set of positive integers. 

Hence, they are not equal. Notice that 

Z 7^ {{•••, — 3 , —2, —1}, {0}, {1, 2, 3 , . . .}} 

either, because the set on the right is a set of three sets, while the set on the left is a set of 
integers. One has three elements; the other has infinitely many elements. ▲ 

To reduce confusion, instead of saying a set of sets, we could say a collection of sets or a 
family of sets. For example, 

{{ 1 , 3 , 5 , ...,}, { 2 , 4 , 6 , ... }} 

is a family of two sets, one of which is the set of positive odd integers; the other is the set of 
positive even integers. 

Definition. A set is said to be finite if it has a finite number of elements. The number of 
elements in a finite set A is called its cardinality , and is denoted by \A\. Hence, |A| is always 
nonnegative. If A is an infinite set, some authors would write |A| = oo. <0> 

Example 4.1.11 While it is trivial that | {1, 4, 7, 8} | = 4, and |{0,1}| = 2, it may not be 
obvious that 

|{0,{!}}| =2, 

and 

[{{...,-3,-2,— 1},0, {1,2,3,...}}| =3. 
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What matters is the number of entries in a set, which can be compared to how many items you 
can find when you open a box. Here is another example: 

|{x € K. | x 2 = 9} | = 2 

because the equation x 2 = 9 has two real solutions. What is \{x £ N | x 2 = 9}|? A 

Hands-On Exercise 4.1.11 Determine these cardinalities: 

(a) \{x £ Z | x 2 — 7x — 6 = 0} | 

(b) \{x £ R. | x 2 — x — 12 < 0} | 

(c) \{x £ Z \ x is prime and x is even}| 

Recall that your answers should be nonnegative. A 

Hands-On Exercise 4.1.12 Explain why it is incorrect to say |0| = 0. In fact, it is nonsense |0| is an integer. 
to say |0| =0. Explain. What should be the value of |0|? 


A 


We close this section with an important remark about sets. It follows from the definition 
of equality of sets that we do not count repeated elements as separate elements. For example, 
suppose a small student club has three officers: 

chair: Mary, 

vice chair: John, 
secretary: John; 


and let A represent the set of its officers, and B the set of positions in its executive board, then 
|A| = 2 and \B\ = 3, because 

A = {Mary, John}, 


and 


B = {chair, vice chair, secretary}. 


Example 4.1.12 Find the errors in the following statement: 


|{ 2 , 2 }| = { | 2 1 , | 2 |} = { 2 } = 2 , 


and correct them. 

Solution: This statement contains several errors. The first mistake is assuming that we can 
distribute the “absolute value” symbols | | over the contents of a set: 

I { 2, 2}| ^ { | — 2|, 1 2 1 } . 

After all, the two vertical bars do not mean absolute value in this case. Instead, it means the 
cardinality of the set {—2,2}. Hence, | { — 2, 2} | = 2. 

The second equality { | — 2|, |2|} = {2} is correct. After taking absolute values, both entries 
become 2. However, we do not write {| — 2|,|2|} = {2,2}, because a set should not contain 
repetition. Therefore, it is correct to say { | — 2|, |2|} = {2}. 
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The last equality {2} = 2 is wrong. We cannot compare a set to a number. Imagine the set 
{2} as a box containing only one object, and that object is the number 2. In contrast, 2 on the 
right-hand side is left in the open air without any containment. It is clear that {2} ^ 2. 

The entire statement contains multiple mistakes; some of them are syntactical errors while 
some are conceptual. Nevertheless, we do have |{— 2, 2} | = 2. Although the final answer is 
correct, the argument used to obtain it is not. A 

In some situations, we do want to count repeated elements as separate elements, as in 
S = {1, 2, 2, 2, 3, 3, 4, 4}. We call such a collection a multiset instead of an ordinary set. In this 
case, |Sj =8. 


Summary and Review 

• A set is a collection of objects (without repetitions). 

• To describe a set, either list all its elements explicitly, or use a descriptive method. 

• Intervals are sets of real numbers. 

• The elements in a set can be any type of object, including sets. 

• We can even have a set containing dissimilar elements. In particular, we can mix elements 
and sets inside a set. 

• If a set A is finite, its cardinality |A| is the number of elements it contains. Consequently, 
|A| is always nonnegative. 


Exercises 4.1 

1. Write each of these sets by listing its elements explicitly (that is, using the roster method). 

(a) {n € Z | — 6 < n < 4} (b) {n £ N | -6 < n < 4} 

(c) {x € Q | x 3 — x 2 — 6x = 0} (d) {x € Q | x A — llx 2 + 18 = 0}. 

2. Use the roster method to describe these sets: 

(a) {x € N | x < 20 and a; is a multiple of 3} 

(b) {x £ Z | \x\ < 20 and x is a multiple of 3 or a multiple of 5} 

(c) {x € Z | \x\ < 20 and x is a multiple of 3 and a multiple of 5} 

(d) {x £ N | x < 20 and a; is a multiple of 3 but not a multiple of 5} 

3. Write each of the following sets in the form {n £ Z \ p(n)j with a logical statement p(n) 
describing the property of n. 


(a) {. . . , -3, -2, -1} (b) {. . . , -27, -8,-1, 0, 1, 8, 27, . . .} 

(c) {0, 1, 4, 9, 16, . . .} (d) {. . . , -15, -10, -5, 0, 5, 10, 15, . . .} 

(e) {0, 4, 8, 12, . . .} (f) {. . . , -14, -8, -2, 4, 10, 16, . . .} 

4. Repeat Problem 3, but write the sets in the form {/(n) | n £ 5}, where /(n) is a formula 

that describes the pattern of the elements, and S is an appropriate set of numbers. 

5. Whenever possible, express the sets in Problem 3 in the form S + , S ~ , bS, or a + bS for 
some appropriate set S. 

6. Determine whether the following sets are empty, finite sets, or infinite sets: 


(a) {2n + 1 | n £ N} 

(c) {x € Q | x > 0 and x < 0} 


(b) {a: € R | x 2 < 0} 

(d) {x € N | x < 0 or x > 0} 
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7. Write each of these sets in the interval notation. 


(a) {x G R | — 4 < x < 7} 
(c) {x G R + | — 4 < x < 7} 
(e) {x G R | x < 6} 


(b) {x G 
(d) {x G 
(f) {x G I 


— 4 < x < 7} 
—4 < x} 

| 0 < x < 6} 


8. Is [— 00 , 00 ] a legitimate or correct notation? Explain. 

9. Evaluate the following expressions. 


(a) \{x G Z | — 4 < x < 7} | 
(c) |{ar G N | — 4 < x < 7} | 
(e) |{ 3, —2, 2, 3} | 


(b) \{x G Z | — 4 < x < 7} | 

(d) |{x G R | x 4 — 2x 3 — 35x 2 = 0} | 
(f) |{cc G Q | a; 2 = 3}] 


10. Determine which of the following statements are true, and which are false. 


(a) a G {a} 

(c) 0 G 0 
( e ) { } = 0 


(b) {3,5} = {5,3} 
(d) 0 = {0} 

(f) 0 G {0} 


11. Determine which of the following statements are true, and which are false. 


(a) 2 G (2, 7) 
(c) (^5 ) 2 G(! 


(b) v/2G (1,3) 
(d) -5 G N 


12. Give examples of sets A, B and C such that: 

(a) A G B and B G C, and A ^ C 

(b) A G B and B G C, and A G C 

13. Determine whether the following statements are correct or incorrect syntactically. For 
those that are syntactically correct, determine their truth values; for those that are syn- 
tactically incorrect, suggest ways to fix them. 

(a) (3,7] = 3 < x < 7. 

(b) {a: G R | x 1 < 0} = 0. 

14. Determine whether the following statements are correct or incorrect syntactically. For 
those that are syntactically correct, determine their truth values; for those that are syn- 
tactically incorrect, suggest ways to fix them. 

(a) | G [2,V7). 

(b) There does not exist x such that x G R + and R . 

(c) If (0, 00 ), then x is positive. 


4.2 Subsets and Power Sets 

We usually consider sets containing elements of similar types. The collection of all the objects 
under consideration is called the universal set, and is denoted U. For example, for numbers, 
the universal set is R. 
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Example 4.2.1 Venn diagrams are useful in demonstrating set relationship. Let 

U = set of geometric figures, 

S = set of squares, 

P = set of parallelogram, 

R = set of rhombuses, 

L = set of rectangles, 

C = set of circles. 

Their relationship is displayed in Figure 4.2. A 



Figure 4.2: The relationship among various sets of geometric figures. 


The pictorial representation in Figure 4.2 is called a Venn diagram. We use a rectangle 
to represent the universal set, and circles or ovals to represent the sets inside the universal set. 
The relative positions of these circles and ovals indicate the relationship of the respective sets. 
For example, having R, S , and L inside P means that rhombuses, squares, and rectangles are 
parallelograms. In contrast, circles are incomparable to parallelograms. 

A set A is a subset of another set B, denoted by A C B, if every element of A is also an 
element of B. See Figure 4.3. We also call B a superset of A, and write BAA, which is 
similar to y > x. 



Figure 4.3: The Venn diagram for A C B. 


Example 4.2.2 It is clear that N C Z and ZCI. We can nest these two relationships into 
one, and write N C Z C R. More generally, we have 

NCZCQCI. 

Compare this to x < y < z < w. We shall discover many similarities between C and <. A 






4.2 Subsets and Power Sets 


91 


Example 4.2.3 It is obvious that 

{1,2,7} C {1,2, 3, 6, 7, 9} 

because all three elements 1, 2, and 7 from the set on the left also appear as elements in the set 
on the right. Meanwhile, 

{1,2,7} £{1,2,3,6,8,9} 

because 7 belongs to the first set but not the second. A 

Example 4.2.4 The following statements are true: 

(a) {1,2,3} CN. 

(b) {x G R | x 2 = 1} C Z. 

Be sure you can explain clearly why these subset relationships hold. A 


Hands-On Exercise 4.2.1 Are these statements true or false? 

(a) {-1, 2} £ N, and {-1, 2} C Z. 

(b) {x G Z | x 2 < 1} C R. A 

Example 4.2.5 Do not assume that if A <£. B then we must have B C A. For instance, if 
A = {1, 5, 7} and B = {3, 8}, then A <£. B\ but we also have B (£ A. A 


The last example demonstrates that A J- B is more complicated than just changing the 
subset notation like we do with inequalities. We need a more precise definition of the subset 
relationship: 

ACB^\/xeU(xeA^xeB). 

It follows that 

A<£ B <&3x £U (x £ AAx £ B). 

Hence, to show that A is not a subset of B , we need to find an element x that belongs to A but 
not B. There are three possibilities; their Venn diagrams are depicted in Figure 4.4. 


The definition of 
ACB. 


u 


(3 

u) 




Figure 4.4: Three cases of A $7 B. 

Example 4.2.6 We have [3,6] C [2,7), and [3,6] [4,7). We also have (3,4) C [3,4]. A 

Hands-On Exercise 4.2.2 True or false: [3,4) C (3,4)? Explain. 


A 
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A way to prove 
A = B. 


Compare this to 
(x <y)A(y < z ) 
=> x < z. 


With the notion of universal set, we can now refine the definition for set equality: 

A = B<=>\/x£lf(x£A<^x£ B). 

Logically, x £ A 4$ x £ B is equivalent to 

(a: G A =>• x £ B) A (x £ B => x £ A). 

Therefore, we can also define the equality of sets via subset relationship: 

A = B <=> (A C B) A (B C A), 

which can be compared to 

x = y (x < y) A (y < x) 

for real numbers x and y. 

This new definition of set equality suggests that in order to prove that A — B, we could use 
this two-step argument: 

1. Show that AC B. 

2. Show that B C A. 

This technique is useful when it is impossible or impractical to list the elements of A and B for 
comparison. This is particularly true when A and B are defined abstractly. We will apply this 
technique in the coming sections. 

The two relationship C and < share many common properties. The transitive property is 
another example. 

Theorem 4.2.1 Let A, B, and C he sets. If AC B and B C C, then ACC. 

Discussion. The theorem statement is in the form of an implication. To prove p => q, we start 
with the assumption p, and use it to show that q must also be true. In this case, these two steps 
become 

(i) Assume that AC B and B C C. 

(ii) Show that ACC. 

How can we prove that A C C ? We know that ACC means 

\/x£lf(x£A=>x£ C). 

So we have to start with x £ A, and attempt to show that x £ C as well. How can we show 
that x £ Cl We need to use the assumption AC B and B C C. <0> 

Proof: Assume A C B and B C C. Let x £ A. Since A C B, we also have x £ B. Likewise, 
B C C implies that x £ C. Since every element a; in A is also an element of C, we conclude 
that ACC. ■ 

The proof relies on the definition of the subset relationship. Many proofs in mathematics 
are rather simple if you know the underlying definitions. 

Example 4.2.7 Prove that x £ A <t=> {:r} C A, for any element x £ U. 

Discussion. We call p q a biconditional statement because it consists of two implications 
p =$■ q and p -<= q. Hence, we need to prove it in two steps: 

1. Show that p => q. 

2. Show that q => p. 
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We call these two implications the necessity and sufficiency of the biconditional statement, 
and denote them (=>) and (-4=), respectively. In this problem, 

• (=>) means “x G A => {x} C A”. 

• (-4=) means “{x} C A => x £ A”. 

This is how the proof may look: 


(=0 

Assume x G A. 

Therefore {x} C A. 

M 

Assume {x} C A. 

Therefore x G A. 


We now proceed to finish the proof. <C> 

Solution: (=4>) Assume x G A. The set {x} contains only one element x, which is also an 
element of A. Thus, every element of {x} is also an element of A. By definition, {x} C A. 

(4=) Assume {a:} C A. The definition of the subset relationship asserts that every element of 
{x} is also an element of A. In particular, x is an element of {x}, so it is also an element of A. 
Thus, x G A. A 

Definition. The set A is a proper subset of B, denoted A C B or A C B, if A is a subset of 
B , and A ^ B. Symbolically, A C B 4=> (A C B) A (A B). Equivalently, 

A c B (A C B) A 3x G U (x G B A x £ A). 

See the Venn diagram in Figure 4.5. <0> 



Figure 4.5: The definition of a proper subset. 

Example 4.2.8 It is clear that [0, 5] C R. We also have 

NcZcQcR. 

Note the similarities between C and <. Compare the last expression to 

x < y < z < w. 

Here is another similarity between C and <. For numbers, x < y and y < z together imply that 
x < z. We call this the transitive property. In a similar fashion, for sets, if A C B and B C C, 
then AcC; see Theorem 4.2.1. A 

Hands-On Exercise 4.2.3 True or false: (3,4) C [3,4]? How about (3,4) C (3,4]? 


A 
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An implication is 
always true if its 
hypothesis is false. 


Every element in 
p(A) is a stand-alone 
set; more precisely, it 
is a subset of A. 


Theorem 4.2.2 For any set A, we have 0C4 and AC A. In particular, 0 C 0. 

Proof: Since every element of A also appears in A, it follows immediately that A C A. To 
show that 0 C A, we need to verify the implication 

x G 0 => x G A 

for any arbitrary x G U. Since 0 is empty, x € 0 is always false; hence, the implication is always 
true. Consequently, 0 C A for any set A. In particular, when A = 0, we obtain 0 C 0. ■ 

Example 4.2.9 Determine the truth values of these expressions. 

(a) 0 G 0 (b) 1 C {1} (c) 0 G {0} 

Solution: (a) By definition, an empty set contains no element. Consequently, the statement 
0 G 0 is false. 

(b) A subset relation only exists between two sets. To the left of the symbol C, we have only a 
number, which is not a set. Hence, the statement is false. In fact, this expression is syntactically 
incorrect. 

(c) The set {0} contains one element, which happens to be an empty set. Compare this to an 

empty box inside another box. The outer box is described by the pair of set brackets {•■•}, 
and the (empty) box inside is 0. It follows that 0 G {0} is a true statement. ▲ 

Hands-On Exercise 4.2.4 Determine the truth values of these expressions. 

(a) 0 C {0} (b) {1} C {1,{1, 2}} (c) {1} C {{!},{!, 2}} 


A 

Definition. The set of all subsets of A is called the power set of A, denoted p(A). <C> 

Since a power set itself is a set, we need to use a pair of left and right curly braces (set 
brackets) to enclose all its elements. Its elements are themselves sets, each of which requires its 
own pair of left and right curly braces. Consequently, we need at least two levels of set brackets 
to describe a power set. 

Example 4.2.10 Let A = {1,2} and B = {1}. The subsets of A are 0, {1}, {2} and {1,2}. 
Therefore, 

p(A) = {0,{l},{2},{l,2}}. 

In a similar manner, we find 

p(S) = {0,{l}}. 

We can write directly 

P ({1,2}) = {0,{1},{2},{1,2}}, and p({l}) = {0, {1}} 
without introducing letters to represent the sets involved. ▲ 
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Hands-On Exercise 4.2.5 Let us evaluate p({ 1, 2, 3, 4}). To ensure that no subset is missed, Since 0 C 
we list these subsets according to their sizes. Since 0 is the subset of any set, 0 is always an set A, we 
element in the power set. This is the subset of size 0. Next, list the singleton subsets (subsets 0 e p(A). 
with only one element). Then the doubleton subsets, and so forth. Complete the following table. 


size 

subsets 

0 

0 

1 


2 

{1,2}, {1,3},... 

3 

{1,2,3},... 

4 



Since A C A for any set A, the power set p(A) always contains A itself. As a result, the last 
subset in the list should be A itself. 

We are now ready to put them together to form the power set. All you need is to put all the 
subsets inside a pair of bigger curly braces (a power set is itself a set; hence, it needs a pair of 
curly braces in its description). Put your final answer in the space below. 


Check to make sure that the left and right braces match perfectly. A 

Example 4.2.11 Since A is a subset of A, it belongs to p(A). Nonetheless, it is improper to 
say A C p(A). Can you explain why? What should be the correct notation? 

Solution: The power set p(A) is the collection of all the subsets of A. Thus, the elements in 
p(A) are subsets of A. One of these subsets is the set A itself. Hence, A itself appears as an 
element in p(A), and we write A £ p(A) to describe this membership. 

This is different from saying that A C p(A). In order to have the subset relationship 
A C p(A), every element in A must also appear as an element in p(A). The elements of p(A) 
are sets (they are subsets of A, and subsets are sets). An element of A is not the same as a 
subset of A. Therefore, although A C p(A) is syntactically correct, its truth value is false. A 


Hands-On Exercise 4.2.6 Explain the difference between 0 and {0}. How many elements are 
there in 0 and {0}? Is it true that p(0) = {0}? 


A 

Theorem 4.2.3 If A is an n-element set, then p(A) has 2" elements. In other words, an 
n-element set has 2" distinct subsets. 

Proof: How many subsets of A can we construct? To form a subset, we go through each of 
the n elements and ask ourselves if we want to include this particular element or not. Since 
there are two choices (yes or no) for each of the n elements in A, we have found 2 • 2 • • • • 2 = 2" 


A for any 
always have 


subsets. 


n times 
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Hands-On Exercise 4.2.7 How many elements are there in p({a-,/3, 7})? What are they? 


A 

Hands-On Exercise 4.2.8 What is the cardinality of 0? How about p(0)? Describe p(0). 

A 

Is it correct to write |p(A)| = 2^1 ? How about |p(A)| = 2 A ? 

A 

Example 4.2.12 When a set contains sets as elements, its power set could become rather 
complicated. Here are two examples. 

p({W,{ 1}}) = {0, {{«}}, {{1}}, {{a}, {!}}}, 

p({0 ,{l}}) = { 0 ,{ 0 },{{ 1 }},{ 0 ,{ 1 }}}. 

Be sure you understand the notations used in these examples. In particular, examine the number 
of levels of set brackets used in each example. ▲ 

Summary and Review 

• A set S' is a subset of another set T if and only if every element in S can be found in T. 

• In symbols, SCT^\/x£U(x£S=>x€T). 

• Consequently, to show that SCT, we have to start with an arbitrary element x in S, and 
show that x also belongs to T. 

• The definition of subset relationship implies that for any set S, we always have 0 C S and 
SCS. 

• The power set of a set S, denoted p(S), contains all the subsets of S. 

• If |S| = n, then |p(S)| = 2 n . Hence, an n-element set has 2 n subsets. 

• To construct p(S'), list the subsets of S according to their sizes. Be sure to use a pair of 
curly braces for each subset, and enclose all of them within a pair of outer curly braces. 

Exercises 4.2 

1. Determine which of the following statements are true and which are false. 

(a) {1,2,3} C {0,1, 2, 3, 4} (b) {1,2,3} CN 

(c) {1,2} c [1,2] (d) [2,4] C (0,6) 

(e) [2,4) C [2,4] (f) [2,4) C (2,4] 

2. Determine which of the following statements are true and which are false. 


Hands-On Exercise 4.2.9 

Explain. 


(a) a C {a} 
(c) 0 C 0 
(e) 0 C {0} 


(b) {a} C {a, 6} 

(d) 0 C {0} 

(f) {a}Cp({{a},{ 6 }}) 
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3. Explain why Z C Q. In particular, explain how to express an integer as a rational number. 

4. True or false: N C 6N? Explain. 

5. If A C B, B C C, and CCD, is it true that A C D7 What do you call this property? 

6. Determine whether the following statements are true or false: 

(a) The empty set 0 is a subset of {1, 2, 3}. 

(b) If A = {1, 2, 3}, then {1} is a subset of p(A). 

7. Find the power set of the following sets. 

(a) {a,b} (b){4, 7} (c ){x,y,z,wj 

(d) {{a}} (e) {a, {&}} (f) {{*}, {y}} 

8. Evaluate the following sets. 

(a)p({0}) (b )p(p({a,b})) (c)p(p(p(0))) 

9. We have learned that A C A for any set A. Then, should we write A £ p(A ) ordC p(A)? 
Explain. 

10. Prove that X £ p{A) if and only if X C A. 

11. Determine which of the following statements are true, and which are false. Explain! 

(a) {a} £ {a, b, c} (b) {a} C {{a},b, c} (c) {a} G p({{a}, b, c}) 

12. Determine which of the following statements are true, and which are false. Explain! 

(a) {a} C {a, b, c } (b) {a} C {{a, b}, c} (c) {a} C p({{a}, b, c}) 


4.3 Unions and Intersections 


We can form a new set from existing sets by carrying out a set operation. 
Definition. Given two sets A and B , define their intersection to be the set 

AcB = {x£lA\x£Af\x£ B}. 

Loosely speaking, An B contains elements common to both A and B. 


Note the similarity 
between the symbols 
(T and A. 


❖ 


Definition. The union of A and B is defined as 

Aid B = {x £U \ x £ A\/ x £ B}. 

Thus AC B is, as the name suggests, the set combining all the elements from A and B. 


0 


Note the similarity 
between the symbols 
U and V. 




An B 


AC B 
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x € A — B means x 
belongs to A but not 
to B. 


Definition. The set difference A — B, sometimes written as A\B, is defined as 

A — B = {xGU\xgAAx^.B}. 

In words, A — B contains elements that can only be found in A but not in B. Operationally 
speaking, A — B is the set obtained from A by removing the elements that also belong to B. 
Therefore, the set difference A — B is also called the relative complement of B in A. In 
particular, U — A is called the complement of A, and is denoted by A , A' or A c . <C> 




Remark. We would like to remind the readers that it is not uncommon among authors to 
adopt different notations for the same mathematical concept. Likewise, the same notation could 
mean something different in another textbook or even another branch of mathematics. It is 
important to develop the habit of examining the context and making sure that you understand 
the meaning of the notations when you start reading a mathematical exposition. <0> 

Example 4.3.1 Let U = {1, 2, 3, 4, 5}, A = {1, 2, 3}, and B = {3, 4}. Find A n B, A U B, 
A — B, B — A, A, and B. 

Solution: We have 


A<1B = {3}, 

AUB = {1,2, 3, 4}, 

A — B = {1,2}, 

B-A = {4}. 

We also find A = {4, 5}, and B = { 1,2, 5}. 

Hands-On Exercise 4.3.1 Let U = {John, Mary, Dave, Lucy, Peter, Larry}, 

A = {John, Mary, Dave}, and B = {John, Larry, Lucy}. 
Find A n B, A U B, A — B 1 B — A, A, and B. 


A 


A 


Hands-On Exercise 4.3.2 If A C B, what would be A — B1 


A 
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Example 4.3.2 The set of integers can be written as the 

Z = {-l,-2,-3,...}U{0}U{l,2,3,...}. 

Can we replace {0} with 0? Explain. A 

Hands-On Exercise 4.3.3 Explain why the following expressions are syntactically incorrect. 

(a) Z = {—1, —2, —3, . . .} U 0 U {1, 2, 3, . . .}. 

(b) Z = ...,-3,-2,-l U 0 U 1,2,3,... 

(c) Z = . 3, — 2, — 1 + 0 + 1,2,3,... 

(d) Z = lr U0UZ+ 

How would you fix the errors in these expressions? 


A 

Example 4.3.3 For any set A. what are H n 0, H U 0, A — 0, 0 — A and A? 

Solution: It is clear that 

A n 0 = 0, A U 0 = A, and A - 0 = A. 

From the definition of set difference, we find 0 — A = 0. Finally, A = A. A 

Example 4.3.4 Write, in interval notation, [5,8) U (6,9] and [5,8) fl (6,9]. 

Solution: The answers are 

[5,8) U (6,9] = [5,9], and [5, 8) D (6, 9] = (6, 8). 

They are obtained by comparing the location of the two intervals on the real number line. A 
Hands-On Exercise 4.3.4 Write, in interval notation, (0,3) U [—1,2) and (0,3) fl [—1,2). 


A 

Example 4.3.5 We are now able to describe the following set 

{x £ K. | (x < 5) V [x > 7)} 


in the interval notation. It can be written as either (— oo,5) U (7, oo) or, using complement, 
K — [5,7]. Consequently, saying x ^ [5, 7 ] is the same as saying x € (— oo,5) U (7, oo), or 
equivalently, x € K. — [5, 7]. A 
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Theorem 4.3.1 The following properties hold for any sets A, B, and C in a universal set U. 

1. Commutative properties: A U B = B U A, 

AnB = Bn A. 

2. Associative properties: (A U B) U C = A U (B U C), 

(AnB)nC = An(BnC). 

3. Distributive laws: A U (B fi C) = (iUB)n(4U C), 

A n (b u C) = (A n B) u (A n C). 

4- Idempotent laws: A U A = A, 

A n A = A. 


5. De Morgan’s laws: A U B = An B 7 

AnB = AuB. 

6. Laws of the excluded middle, or inverse laws: AuA = W, 

An A = 0. 


As an illustration, we shall prove the distributive law 

A U (B n C) = (A U B) n (A u C). 


We need to show that 

Va: € U [x £ A U {B n C) <^> x G (AuB)n(AU C)] . 

Equivalently, we need to show that 

AU(BnC)C(AuB)n(AuC), and (AuB)n(AuC)CAu(Bn C). 

Either way, we need to establish the equality in two steps. 

We now present two proofs of the distributive law A U (B n C) = (A U B) n (A U C). 

Proof 1: Let x G A U (B n C). Then x G A, or x £ B n C. We know that x G B n C implies 
that x G B and x £ C. So we have 

(i) x € A or x € B, and 

(ii) x £ A or x € C; 

equivalently, 

(i) x £ A U B, and 

(ii) x £ An C. 

Thus, x £ (AuB)n(AuC). We have proved that Au(BnC) C (AuB)n(AuC'). 

Now let x £ (AuB)n(AuC). Then x £ A U B and x £ A U C. From the definition of 
union, we find 

(i) x £ A or x £ B, and 

(ii) x £ A or x £ C. 

Both conditions require a: £ A, so we can rewrite them as 

(i) x £ A, or 

(ii) x £ B and x £ C; 


equivalently, 
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(i) x £ A, or 

(ii) x £ B C\C. 

Thus, x € A U (B (~l C). This proves that (A U B) D {A U C) C A U (B fl C ). Together with 
Au (B C\ C) C (j4uB)n(i4uC), we conclude that A U (B n C) = (A U B) n {A U C). ■ 

Below is an alternate proof. This type of argument is shorter, but is more symbolic; hence, 
it is more difficult to follow. 


Proof 2: Since 


x £ AU (B (1C) x £ AV x £ (B nc) 

x £ AV (x £ B A x £ C) 

(x £ AV x £ B) A (x £ A\/ x £ C) 
o (x £ A U B) A (x £ A U C) 
x £ (A U B) n {A U C) 


(defn. of union) 

(defn. of intersection) 
(distributive law) 
(defn. of union) 

(defn. of intersection) 


it follows that A U (B n C) 


(AUB)<1 ( AuC ). 


Hands-On Exercise 4.3.5 Prove that A n (B U C) = {A n B) U {A n C) . 


A 


Hands-On Exercise 4.3.6 Prove that if A C B and ACC, then A C B n C. 

Discussion. Let us start with a draft. The statement we want to prove takes the form of 

{A C B) A (A C C) => A C B n C. 

Hence, what do we assume and what do we want to prove? 

Assume: 

Want to Prove: 

Did you put down we assume A C B and ACC, and we want to prove A C B fl Cl Great! 
Now, what does it mean by A C B1 How about A C Cl What is the meaning of A C B D Cl 
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A C B means: For any x £ U, if x £ A, then x G B as well. 

ACC means: 

A C B (~l C means: 

How can you use the first two pieces of information to obtain what we need to establish? 


Now it is time to put everything together, and polish it into a final version. Remember three 
things: 

(i) the outline of the proof, 

(ii) the reason in each step of the main argument, and 

(iii) the introduction and the conclusion. 

Put the complete proof in the space below. 


A 


Here are two results involving complements. 

Theorem 4.3.2 For any two sets A and B, we have A C B <t=> B C A. 

Theorem 4.3.3 (Generalized De Morgan’s Laws) For any sets A, B and C, 

A-(BUC) = {A- B)n(A-C), 

A-(BnC) = (A- B)U(A-C), 

Summary and Review 

• Memorize the definitions of intersection, union, and set difference. We rely on them to 
prove or derive new results. 

• The intersection of two sets A and B , denoted A (~l B, is the set of elements common to 
both A and B. In symbols, Vx € U [x € An B <=$■ (x € A A x € £?)] . 

• The union of two sets A and B , denoted A\J B, is the set that combines all the elements 
in A and B. In symbols, Mx cU\x C AC\ B (x C A\/ x C B)] . 

• The set difference between two sets A and B , denoted by A — B, is the set of elements 
that can only be found in A but not in B. In symbol, it means \/x CU \x C A — B <=>■ (x £ 
A Ax £ £?)] . 

• Know the properties of intersection, union, and set differences listed in Theorem 4.3.1. 

Exercises 4.3 

1. Write each of the following sets by listing its elements explicitly. 


(a) [-4,4] nZ 
(d) (—oo, 4] n N 


(b) (-4,4] nZ 
(e) (—4, oo) n Z 


(c) (—4, oo) n Z 
(f) (4,5)nz 
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2. Assume U = Z, and let 

A = {...,-6, -4, -2, 0,2, 4, 6,...} 

B = {...,-9, -6, -3, 0,3, 6, 9,...} 

C = {...,-12, -8, -4, 0,4, 8, 12,...} 

Describe the following sets by listing their elements explicitly 


(a) Ad B 

(b) C — A 

(c ) A- B 

(d) An5 

(e) B- A 

(f) B U C 

(g) (A u B) n c 

(h) (A U B) - C 


3. Are these statements true or false? 

(a) [1,2] n [2,3] =0 

(b) [1, 2) U (2, 3] = [2,3] 

4. Let the universal set U be the set of people who voted in the 2012 U.S. presidential election. 
Define the subsets D, B, and W of U as follows: 


D 

= {x £ U 

x registered as a Democrat}, 

B 

= {x € U 

x voted for Barack Obama}, 

W 

= {x £ U 

| x belonged to a union}. 


Express the following subsets of U in terms of D , B, and W. 

(a) People who did not vote for Barack Obama. 

(b) Union members who voted for Barack Obama. 

(c) Registered Democrats who voted for Barack Obama but did not belong to a union. 

(d) Union members who either were not registered as Democrats or voted for Barack 
Obama. 

(e) People who voted for Barack Obama but were not registered as Democrats and were 
not union members. 

(f) People who were either registered as Democrats and were union members, or did not 
vote for Barack Obama. 

5. An insurance company classifies its set U of policy holders by the following sets: 


A 

= {x 

| x drives a subcompact car}, 

B 

= {x 

| x drives a car older than 5 years}, 

C 

= 

| x is married}, 

D 

= {x 

| x is over 21 years old}, 

E 

= {x 

| a: is a male}. 


Describe each of the following subsets of U in terms of A, B , C, D , and E. 

(a) Male policy holders over 21 years old. 

(b) Policy holders who are either female or drive cars more than 5 years old. 

(c) Female policy holders over 21 years old who drive subcompact cars. 

(d) Male policy holders who are either married or over 21 years old and do not drive 
subcompact cars. 

6. Let A and B be arbitrary sets. Complete the following statements. 


= 2Z, 
= 3Z, 
= 4Z. 
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(a) ACB<=>-ACB = . 

(b) AC B & AUB = 

(c) AC B A - B = . 

(d) AcB<=t>(A-B = A 5-4/ ). 

(e) A c B (A n B = A A n B ± ). 

(f) A- B = B - A& . 

7. Give examples of sets A and B such that Ac B and A C B. 

8. Prove the De Morgan’s laws. 

9. Let A, B , and C be any three sets. Prove that if A C C and B C C, then A U B C C. 

10. Prove Theorem 4.3.2 

11. Prove Theorem 4.3.3 

12. Let A, B , and C be any three sets. Prove that 

(a) A — B = A n B (b) A = (A - B) U {A n B) 

(c) A - (B — C) = A n (B U C) (d) (A - B) - C = A - (B U C) 

13. Comment on the following statements. Are they syntactically correct? 

(a) xcACxcB=xcAcB 

(b) xcA/\B=>xcAcB 

14. Prove or disprove each of the following statements about arbitrary sets A and B. If you 
think a statement is true, prove it; if you think it is false, provide a counterexample. 

(a) p(A CB) = p(A) n p(B) 

(b) p(A U B) = p(A) U p(B) 

(c) p(A — B) = p(A) - p(B) 

Remark. To show that two sets U and V are equal, we usually want to prove that 
x € U <=> x € V . In this problem, the element x is actually a set. Since we usually use 
uppercase letters to denote sets, we should start the proof of (a) with “Let S C p(AnB).” 
If you prefer to use the alternate approach, it looks like the following: 

Scp(ACB) ... 

... 

SC p{A)Cp{B). 

These remarks also apply to (b) and (c). <0> 

4.4 Cartesian Products 

Another way to obtain a new set from two given sets A and B is to form ordered pairs. An 
ordered pair (x,y) consists of two values x and y. Their order of appearance is important, so 
we call them first and second elements respectively. Consequently, (a, b ) ^ (6, a) unless a = b. 
In general, (a, b) = (c, d) if and only if a = c and b = d. 



4.4 Cartesian Products 


105 


Enclose ordered pairs 
in parentheses. Write 
them as (x,y), and 
not as {x, y}. 


Example 4.4.1 Let A = {John, Jim, Dave} and B = {Mary, Lucy}. Determine Ax B and 
B x A. 

Solution: We find 

Ax B = {(John, Mary), (John, Lucy), (Jim, Mary), (Jim, Lucy), (Dave, Mary), (Dave, Lucy)}, 
B x A = {(Mary, John), (Mary, Jim), (Mary, Dave), (Lucy, John), (Lucy, Jim), (Lucy, Dave)}. 

In general, A x B ^ B x A. A 

Example 4.4.2 Determine Ax B and Ax A: 

(a) A ={1,2} and B = {2,5,6}. 

(b) A= {5} and B = {0,7}. 

Solution: (a) We find 

Ax B = {(1,2), (1,5), (1,6), (2, 2), (2, 5), (2, 6)}, 

Ax A = {(1,1), (1,2), (2,1), (2, 2)}. 

(b) The answers are Ax B = {(5, 0), (5, 7)}, and A x A = {(5, 5)}. A 

Hands-On Exercise 4.4.1 Let A = {a,b,c,d} and B = {r, s, t}. Find A x B, B x A, and 
B x B. 


Definition. The Cartesian product of A and B is the set 

Ax B = {(a, b) | a € A Ab G B}. 

Thus, Ax B (read as “ A cross B n ) contains all the ordered pairs in which the first elements are 
selected from A , and the second elements are selected from B. <C> 


A 


Example 4.4.3 Determine p({l,2}) x {3,7}. Be sure to use correct notation. 

Solution: For a complicated problem, divide it into smaller tasks and solve each one separately. Divide and conquer! 

Then assemble them to form the final answer. In this problem, we first evaluate 

p({l,2}) = {0, {!}, {2}, {1, 2}}. 

This leads to 

p({l>2}) x {3,7} = {0,{1},{2},{1,2}} x {3,7} 

= {(0, 3), (0, 7), ({1}, 3), ({1}, 7), ({2}, 3), ({2}, 7), ({1,2}, 3), ({1,2}, 7)}. 

Check to make sure that we have matching left and right parentheses, and matching left and 
right curly braces. A 
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Hands-On Exercise 4.4.2 Find {a, b, c} x p({d}). 


A 

Example 4.4.4 How could we describe the contents of the Cartesian product [1,3] x {2,4}? 
Since [1,3] is an infinite set, it is impossible to list all the ordered pairs. We need to use the 
set-builder notation: 

[1,3] x {2,4 } = {(x,y) | 1< a: <3,y = 2,4}. 

We can also write [1, 3] x {2, 4} = {(#, 2), (x, 4) | 1 < x < 3}. A 

Hands-On Exercise 4.4.3 Describe, using the set-builder notation, the Cartesian product 
[1,3] x [2,4], 


A 

Cartesian products can be extended to more than two sets. Instead of ordered pairs, we need 
ordered n-tuples. The n-fold Cartesian product of n sets Ai, A 2 , . . . , A n is the set 

Ai x A 2 x • • • x An = {(ai, 02 , ... , a n ) | Oj € Ai for each i, 1 < i < n}. 

In particular, when Ai = A for all i, we abbreviate the Cartesian product as A n . 

Example 4.4.5 The n-dimensional space is denoted R". It is the ?r-fold Cartesian product of 
R. In special cases, R 2 is the xy- plane, and R 3 is the xyz- space. A 

Hands-On Exercise 4.4.4 Let A = {1, 2}, B = {o, 6}, and C = {r, s, t}. Find A x B x C. 


A 


(Ax B) x C, 

A x (B x C) and 
A x B x C are three 
different sets. 


Example 4.4.6 From a technical standpoint, (A x B) x C is different from A x B x C . Can 
you explain why? Can you discuss the difference, if any, between (Ax B) x C and A x (B x C)1 
For instance, give some specific examples of the elements in (Ax B) x C and Ax (B x C) to 
illustrate their differences. 


Solution: The elements of (A x B) x C are ordered pairs in which the first coordinates are 
themselves ordered pairs. A typical element in (A x B) x C takes the form of 

((a,b),c). 

The elements in Ax B x C are ordered triples of the form 


(a,b, c). 
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Since their elements look different, it is clear that (AxB)xCy^AxBxC. Likewise, a typical 
element in A x (B x C) looks like 

(a,(b, c)). 

Therefore, (A x B) x C ^ A x (B x C), and Ax (BxC)^AxBxC. A 

Theorem 4.4.1 For any sets A, B, and C, we have 

Ax(BuC) = (A x B) U (A x C), 

Ax(BnC) = (AxB)n(AxC), 

Ax(B-C) = (AxB)-(AxC). 

Remark. How would we show that the two sets S and T are equal? We need to show that 

x £ S -O- x £ T. 

The complication in this problem is that both S and T are Cartesian products, so x takes on 
a special form, namely, that of an ordered pair. Consider the first identity as an example; we 
need to show that 

(it, v) £ A x (BuCJs (it, v) € (A x B) U {A x C). 

We prove this in two steps: first showing =>, then <=, which is equivalent to first showing C, 
then D. Alternatively, we can use throughout the argument. 

Proof 1: Let ( u,v ) £ A x (B U C). Then u £ A, and v £ B U C. The definition of union 
implies that v £ B or v £ C. Thus far, we have found 

(i) u £ A and v £ B, or 

(ii) u £ A and v £ C. 

This is equivalent to 

(i) (u, v) £ A x J5, or 

(ii) (it, v) £ A x C. 

Thus, (it, v) £ (A x B) U (A x C). This proves that A x (B U C) C (A x B) U (A x C). 

Next, let (it, v) £ (Ax B)U (Ax C). Then (it, v) £ Ax B, or (u,v) £ Ax C. This means 

(i) u £ A and v £ B, or 

(ii) it £ A and v £ C. 

Both conditions require it £ A, so we can rewrite them as 

(i) it £ A, and 

(ii) v £ B or v £ C; 

which is equivalent to 

(i) it £ A, and 

(ii) v £ B U C. 

Thus, (it, it) £ Ax (BUC). We have proved that (Ax B)U(AxC) C Ax(BuC). Together with 
A x (B U C) C (A x B) U (A x C) that we have proved earlier, we conclude that A x (B U C) = 
(AxB)U(AxC). m 

Proof 2: We shall only prove the first equality. Since 

(it, v) £ A x (BA C) u £ A f\v £ (BUC) (defn. of Cartesian product) 

<^u£Af\(v£B\/v£C) (defn. of union) 

<=>(u£AAv£B)\/(u£AAv£C) (distributive law) 

(it, v) £ A x B \J (u, v) £ A x C (defn. of Cartesian product) 

(it, »)e(AxB)U(Ax C) (defn. of union) 

we conclude that A x (B U C) = (A x B) U (A x C). ■ 
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Theorem 4.4.2 If A and B are finite sets, with \A\ = m and \B\ = n, then \ A x B\ = mn. 

Proof: The elements of A x B are ordered pairs of the form (a, b), where a £ A, and b £ B. 
There are m choices of a. For each fixed a , we can form the ordered pair (a, b ) in n ways, because 
there are n choices for b. Together, the ordered pairs (a, b) can be formed in mn ways. I 

The argument we used in the proof is called multiplication principle. We shall study it 
again in Chapter 8. In brief, it says that if a job can be completed in several steps, then the 
number of ways to finish the job is the product of the number of ways to finish each step. 

Corollary 4.4.3 If Ai, A 2 , . . . , A n are finite sets , then \Ai x A 2 x • • • x A n \ = |Ai| • \A 2 \ • • • \A n \. 

Corollary 4.4.4 If A is a finite set with \A\ = n, then |p(A)| = 2 n . 

Proof Let the elements of A be a\, a %, . . . , a n . The elements of p(A) are subsets of A. Each 
subset of A contains some elements from A. Associate to each subset S' of A an ordered n-tuple 
(bi, 62, • • • , b n ) from {0, 1}™ such that 

, / 0 if ai g S, 

1 \ 1 if (ij £ S. 

The value of the ?’th element in this ordered n-tuple indicates whether the subset S contains the 
element a,;. It is clear that the subsets of A are in one-to-one correspondence with the n-tuples. 
This means the power set p(A) and the Cartesian product {0, 1}™ have the same cardinality. 
Since there are 2” ordered n-tuples, we conclude that there are 2" subsets as well. ■ 

This idea of one-to-one correspondence is a very important concept in mathematics. We 
shall study it again in Chapter 6. 

Summary and Review 

• The Cartesian product of two sets A and B , denoted Ax B, consists of ordered pairs of 
the form (a, b), where a comes from A, and b comes from B. 

• Since ordered pairs are involved, Ax B usually is not equal to B x A. 

• The notion of ordered pairs can be extended analogously to ordered n-tuples, thereby 
yielding an n-fold Cartesian product. 

• If A and B are finite sets, then \A x B\ = |A| • \B\. 

Exercises 4.4 

1. Let X = {—2,2}, Y = {0,4} and Z = {—3,0,3}. Evaluate the following Cartesian 
products. 

(a ) XxY (b ) X x Z (c ) ZxY xY 

2. Consider the sets X, Y and Z defined in Problem 1. Evaluate the following Cartesian 
products. 

(a ) XxY xZ (b ) [XxY) xZ (c) X x {Y x Z) 

3. Without listing all the elements of X x Y x X x Z , where X , Y , and Z are defined in 
Problem 1, determine \X x Y x X x Z\. 

4. Determine |p(p(p({l, 2})))|. 

5. Consider the set X = {—2,2}. Evaluate the following Cartesian products. 

(a) X x p(X) (b) p(X) x p(X) 


(c) p(X x X ) 
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6. Let A and B be arbitrary nonempty sets. 

(a) Under what condition does A x B = B x A? 

(b) Under what condition is [A x B) D (B x A) empty? 

7. Let A, B , and C be any three sets. Prove that 

(a) ix(BnC , ) = ( J 4xfl)n(AxC) 

(b) A x (B - C) = (A x B) - {A x C) 

8. Let A, B , and C be any three sets. Prove that if A C B, then A x C C B x C. 

4.5 Index Sets 

The notion of union can be extended to three sets: 

AuBuC = {xeU\ {x £ A) V (x € B) V (a: € C 1 )}. 

It is obvious how to generalize it to the union of any number of sets. We use a notation that 
resembles the summation notation to describe such a union: 

n 

[jA i =A 1 UA 2 U---UA n . 

i = 1 


We define 

n 

= { x G IA | (x G -^i) V (x G ^. 2 ) V • • • V (x G 
It looks messy! Here is a better alternative: 


n 

Ai = {x eu \ x e Ai for some i , where 1 < i < n}. 

i=l 


In a similar manner, HILi ^ = A\ H A 2 fl • • • D A n , and we define 


n 

G| Ai = {x G U | x € Ai for all i, where 1 < i < n}. 
i = 1 


In plain English, (J” Ai is the collection of all elements in the Ai s, and n"=i ^ n is the collection 
of all elements common to all Ai s. 

Example 4.5.1 For i = 1, 2, 3, . . . , let Ai = [— i, *]. First, construct several A j for comparison, 
because it may help us detect any specific pattern. See Figure 4.6. It is clear that A\ C A 2 C • • • . 
Thus, ULr Ai = [— n, n] = A n , and fl"=i Ai = [-1, 1] = -4i- A 

Hands-On Exercise 4.5.1 Evaluate U”=i anc ^ fllLi where Bi = [0,2 i). 


A 






110 


Chapter 4 Sets 


Ay. — i 1 1 4 1 4 1 1 h 

-4 -3-2-10 1 2 3 4 

a 2 ; Afe — 1 ~ f: . c — 

-4-3-2-10 1 2 3 4 

A 3 : -A 4 1 1 1 1 1 4 h 

-4-3-2-10 1 2 3 4 

A 4 : —4 1 1 1 1 1 1 1 4- 

-4 -3-2-10 1 2 3 4 


Figure 4.6: Comparing intervals to find their union and intersection. 


It is obvious that we can also extend the upper bound to infinity. 


OO 


u* 

i= 1 

= Ax U A 2 U • • 

• = {x 

£ U 

\ x £ Ai for some i £ N}, 

00 

n Ai 

i= 1 

= A\ n A2 n • • 

■ = {x 

£ U 

\ x £ Ai for all i £ N}. 


In some situations, we may borrow the idea of partial sums from calculus. We first find the 
union or intersection of the first n sets, then take the limit as n approaches infinity. Thus, if the 
limit is well-defined, then 


oo n oo n 

11-4,= lim I J Ai, and Pi Ai = lim Pi A, : . 

n — »oo 1 1 n— >oo 1 1 

4=1 4= 1 4=1 4=1 


Inclusion or exclusion 
of endpoints may 
change when we take 
limit. 


Example 4.5.2 Let A t = [— i,i]. We have learned from the last example that [J” =1 -4j = 
[— n, n\ and f]"=i A% = [—1,1]. Hence, 

00 OO 

1 J A, = lim [— n,n] = (— 00 , 00 ), and f ] Ai = [—1,1]. 

n— >• 00 11 

i = 1 i = 1 

Recall that we write (— 00 , 00 ) instead of [— 00 , 00 ] because ±00 are not numbers, they are merely 
symbols representing infinitely large values. A 

Hands-On Exercise 4.5.2 Evaluate (J^ B, and HPi-®*' where Bi = [0,2 i). 


First take finite union 
and intersection. 


Then take limit. 


Example 4.5.3 Let Bi = (0, 1 — . Determine UPi Bi and D^i Bi- 

Solution: Once again, we have B\ C B 2 C • • • . It is easy to check that 


[J Bi = B n = ( 0, 1 - — , and Q Bi = B x = f 0, - 


2 n 
4=1 

It follows that 

00 / 1 I 00 / 1" 

=(°’ 1 )’ and n ^=(°’2 • 

i—l x J i—1 ' J 

Note that lim^oo (0, 1 — ^] ^ (0, 1] because the endpoint 1 does not belong to any Bi. 


A 


A 


The last step: check 
the endpoints. 
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Hands-On Exercise 4.5.3 Let C, = [0, 1 — 4 . Determine IJi^i C* and Pl^li A 


A 


Example 4.5.4 Let A = (l - 4, 1 + 4). Determine U“i A and fj^i A- 

Solution: As the value of i increases, the value of - decreases. Hence, the left endpoint 1 — 4 
increases, and the right endpoint 1 + - decreases. 




A: 

1 

O 

i 

A = (l-!,l + i) 

0 

1 

2 

1 

(0,2) 

A: — 

O | Q 


2 

(I 3\ 


1 1 3 

2 1 2 


3 

f2 4N 

V 3 ’ 3/ 

A: — 

e 1 © 

2 1 4 

3 1 3 


4 

(3 5\ 

V 4 ’ 4 / 

A: — 

I e 



It is clear that A 2 A 2 A 2 ■ ■ • . Thus, U2u A = A = (0, 2), and f'|“ 1 A = { 1} - A 

Hands-On Exercise 4.5.4 Let A = [— i, 1+4). Determine (JA, A and HSu A 


A 

Hands-On Exercise 4.5.5 For each positive integer i, define A = {i,i + l,i + 2,...,3i}. 
Determine IJSi F i and A 


A 


The next two results are obvious. 


Theorem 4.5.1 If A\ C C A 3 C • • • , then H^i A = A- 
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Theorem 4.5.2 If A\ D A 2 D A 3 D • • •, f/ien Ufci A = bi- 
llow could we describe the union A 2 U A4 U Aq U • • • ? Well, we can write 

U Alr 

i even 

which means that union of Ai, where i is even. Since the set of even positive integers is denoted 
by 2N, another way to describe the same union is 

IU- 

iS2N 

It means the union all At, where i is taken out from the set 2N. Accordingly, 

OO OO 

u a = u a > and n*= r>- 

;=o iSN »= 0 igN 

We can even go one step further, by allowing i to be taken from any set of integers, or any set 
of real numbers, or even any set of objects. The only restriction is that A,; must exist, and its 
content must somehow depend on i. 

In general, given a nonempty set /, if we could associate with each i £ I a set Ai, we define 
the indexed family of sets A as 

A = {Ai | i £ /}. 

We call / the index set, and define 

|^J Ai = {x | x £ Ai for some i £ I}, 
iei 

P| Ai = {x | x £ Ai for all i £ I}. 
iei 

Let us look at a few examples. 

Example 4.5.5 To describe the union 

A\ U A3 U Ay U An U A23, 

we first define the index set to be I = {1, 3, 7, 11, 23}, which is the set of all the subscripts used 
in the union. Now the union can be conveniently described as U e/ A:. A 

Example 4.5.6 Consider five sets 

Ai = {1,4,23}, 

A 2 = {7,11,23}, 

A 3 = {3,6,9}, 

A 4 = {5,17,22}, 

A 5 = {3,6,23}. 

Let I = {2, 5}, then 

U A = A 2 U A 5 = {7, 11, 23} U {3, 6, 23} = {3, 6, 7, 11, 23}. 
iei 

A 


Likewise, Hie/ A = A 2 n A 5 = {7, 11, 23} n {3, 6, 23} = {23}. 
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Hands-On Exercise 4.5.6 Let J = {1,4,5}. Evaluate (J and fj ieJ Ai, where A;s are 
defined in the last example. 


A 

Hands-On Exercise 4.5.7 An index set could be a set of any objects. For instance, the sets 
of numbers in the last example could be the favorite Lotto numbers of five different students. 
We could index these sets according to the names of the students: 

A John = {1,4,23}, 

^Mary = {7,11,23}, 

Aloe = {3,6,9}, 

A Pete = {5,17,22}, 

^Lucy = {3,6,23}. 

If I = {Mary, Joe, Lucy}, what is (J ieJ ? How would you interpret its physical meaning? 


A 

Example 4.5.7 Let / = {x \ x is a living human being}, and define 

Bi = {x € I | x is a child of i}, 

M = {1} U Bt 

for each i € I ■ Then 

n A i = 0’ u a * = n b > = 0 ’ 

iei iei iei 

and 

Bi = I — [x | x’s parents are both deceased }. 
iei 

We leave it as an exercise to verify these unions and intersections. ▲ 

Hands-On Exercise 4.5.8 Verify the intersection and union in the last example. 


A 
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Hands-On Exercise 4.5.9 If I represents a set of students, and Ai represents the set of friends 
of student i, interpret the meaning of Uie/ A * and Die/ Ai- 


A 


We close this section with yet another generalization of De Morgan’s laws. 

Theorem 4.5.3 (Extended De Morgan’s Laws) For any nonempty index set I, we have 

u = n a%, and n*=u A ‘- 

iei iei iei iei 


Proof 1: Let x G U i6 / A t , then 

x Ai = {x \ x £ Ai for some i £ /}. 
iei 

This means x £ Ai for every i G I. Hence, x G Ai for each iGl. Consequently, 

x G P'1 Ai. 
iei 


This proves that \J ieI A, C f| igJ A,. 

Next, let x G flie/ Ai. Then x G Ai for each i G I. This means x (ji Ai for each iGl. Then 

x ^ {x | x G Ai for some i G 1} = |^J Hj. 

iei 

Thus, x G Uiez Aii proving that A, C (Jj gj Ai. We proved earlier that (J !gj A t C f| igJ A z . 
Therefore, the two sets must be equal. 

The proof of Hie/ Ai — Uie/ Ai proceeds in a similar manner, and is left as an exercise. ■ 


Proof 2: 


We shall prove |J ig7 A, = f| ig/ Ai. 

We leave out the explanations for you to fill in: 

x G (J Ai 

x G PJ Ai 

iei 

iei 


x £ Ai for some i 

<S=> 

x ^ Ai for all i 

<:=> 

x £ Ai for all i 

<:=> 

x £ Ai. 


iei 


The proof of Hie/ Ai — Uie/ Ai is left as an exercise. 
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Summary and Review 

• When dealing with arbitrary intersection or union of intervals, first identify the endpoints, 
then analyze the sets involved in the operation to determine whether an endpoint should 
be included or excluded. 

• Intersection and union can be performed on a group of similar sets identified by subscripts 
belonging to an index set. 

• Consequently, intersection or union can be formed by naming a specific index set. 

Exercises 4.5 

1. For each n £ Z + , define A n = (— E, 2n). Find H^Li A n and U^Li A n- 

2. For each n £ Z + , define B n = {m £ Z | — | < m < 3?r}. Evaluate H^Li B n and IJ^Li B n - 

3. Define C n = {n, n + 1, n + 2, . . . , 2n + 1} for each integer n > 0. Evaluate H^Lo an d 
U ~0 Cn- 

4. For each n £ I = {1, 2, 3, . . . , 100}, define D n = [— n, 2n] D Z. Evaluate (] neI D n and 
Une/ D n - 

5. For each n € N, define E n = {— n, —n+ 1, —n+ 2, . . . , n 2 }. Evaluate E n and [J N E n . 

6 . For each n £ N, define F n = \ m £ Z}. Evaluate P| n£N F„ and U neN F n . 

7. Let / = (0,1), and define A; = [l, A] for each i £ I. For instance A 0.5 = [1,2] and 
A f = [!> y]- Evaluate U , eI A and f | iei A i- 

8 . Define I = (0, 1), and for each i £ I, let = (—i, i). Evaluate [J ieI Bi = (— l,oo) and 
Die/ Bi- 

9. Evaluate fl a: 6 (i l 2)( 1 ~ 2 x,a’ 2 ) and U a: 6 (i, 2)( 1 “ 2 x,;r 2 ). 

10. Evaluate f\ 6 ( o,i) (*, y) and LU(o,i) (®, y)- 

11. Let the universal set be R 2 . For each r £ (0, 00 ), define 

A r = {(a :,y) \ y = rx 2 }-, 

that is, A r is the set of points on the parabola y = rx 2 , where r > 0. Evaluate f)re(o 00) Ar 
and Ur 6 ( 0 ,oo) A r- 


12. Prove that f"'| Aj = |^J A,; for any nonempty index set /. 
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Chapter 5 


Basic Number Theory 


5.1 The Principle of Well-Ordering 

Number theory studies the properties of integers. Some basic results in number theory rely on 
the existence of a certain number. The next theorem can be used to show that such a number 
exists. 

Theorem 5.1.1 (Principle of Well-Ordering) Every nonempty subset of N has a smallest 
element. 

The idea is rather simple. Start with the integer 1. If it belongs to S, we are done. If not, 
consider the next integer 2, and then 3, and so on, until we find the first element in S. However, 
like the principle of mathematical induction, it is unclear why “and so on” is possible. In fact, 
we cannot prove the principle of well-ordering with just the familiar properties that the natural 
numbers satisfy under addition and multiplication. Hence, we shall regard the principle of well- 
ordering as an axiom. Interestingly though, it turns out that the principle of mathematical 
induction and the principle of well-ordering are logically equivalent. 

Theorem 5.1.2 The principle of mathematical induction holds if and only if the principle of 
well-ordering holds. 

Proof: (=>) Suppose .S' is a nonempty set of natural numbers that has no smallest element. 
Let 

R = {x € N | x < s for every s € S}. 

Since S does not have a smallest element, it is clear that R (~l S = 0. It is also obvious that 
1 £ R. Assume k £ R. Then any natural number less than or equal to k must also be less than 
or equal to s for every s G S. Hence 1, 2, . . . , k £ R. Because R fl S = 0, we find 1, 2, . . . , k ^ S. 
If k + 1 £ S, then k + 1 would have been the smallest element of S. This contradiction shows 
that k + 1 € R. Therefore, the principle of mathematical induction would have implied that 
R = N. That would make S an empty set, which contradicts the assumption that S is nonempty. 
Therefore, any nonempty set of natural numbers must have a smallest element. 

(*t=) Let S' be a set of natural numbers such that 

(i) 1 e S, 

(ii) For any k > 1, if k £ S, then k + 1 £ S. 

Suppose 5 / N. Then S = N — S ^ 0. The principle of well-ordering states that S has a 
smallest element z. Since 1 £ S, we deduce that z > 2, which makes z — 1 > 1. The minimality 
of z implies that z — 1 ^ S. Hence, z — 1 € S. Condition (ii) implies that z £ S, which is a 
contradiction. Therefore, S = N. ■ 
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The principle of well-ordering is an existence theorem. It does not tell us which element is 
the smallest integer, nor does it tell us how to find the smallest element. 

Example 5.1.1 Consider the sets 

A = {n € N | n is a multiple of 3}, 

B = {n € N | n = —11 + 7m for some m € Z}, 

C = {n € N | n = x 2 — 8a: + 12 for some x € Z}. 

It is easy to check that all three sets are nonempty, and since they contain only positive integers, 
the principle of well-ordering guarantees that each of them has a smallest element. 

These smallest elements may not be easy to find. It is obvious that the smallest element in A 
is 3. To find the smallest element in B , we need —11 + 7m > 0, which means m > 11/7 ss 1.57. 
Since m has to be an integer, we need m > 2. Since —11 + 7 m is an increasing function in m, 
its smallest value occurs when m = 2. The smallest element in B is —11 + 7-2 = 3. 

To determine the smallest element in C, we need to solve the inequality x 2 — 8a; + 12 > 0. 
Factorization leads to x 2 — 8x + 12 = (x — 2)(x — 6) > 0, so we need x < 2 or x > 6. Because 
we determine that the minimum value of x 2 — 8a; + 12 occurs at x = 1 or x = 7. Since 

l 2 - 8 • 1 + 12 = 7 2 - 8 • 7 + 12 = 5, 


The smallest element in C is 5. 


▲ 


Two sets that do not 
have a smallest 
element. 


Example 5.1.2 The principle of well-ordering may not be true over real numbers or negative 
integers. In general, not every set of integers or real numbers must have a smallest element. 
Here are two examples: 


• The set Z. 

• The open interval (0, 1). 

The set Z has no smallest element because given any integer x, it is clear that x — 1 < x, and 
this argument can be repeated indefinitely. Hence, Z does not have a smallest element. 

A similar problem occurs in the open interval (0, 1). If x lies between 0 and 1, then so is |, 
and | lies between 0 and x, such that 

X 

0 < x < 1 => 0 < — < x < 1. 


This process can be repeated indefinitely, yielding 

0 < 


x 

< — < 
2 n 


xxx 

< A < »< 2 <X<1 - 


We keep getting smaller and smaller numbers. All of them are positive and less than 1. There 
is no end in sight, hence the interval (0, 1) does not have a smallest element. ▲ 


The idea behind the principle of well-ordering can be extended to cover numbers other than 
positive integers. 


Definition. A set T of real numbers is said to be well-ordered if every nonempty subset of 
T has a smallest element. <C> 


Therefore, according to the principle of well-ordering, N is well-ordered. 
Example 5.1.3 Show that Q is not well-ordered. 


A proof by 
contradiction. 


Solution: Suppose x is the smallest element in Q. Then x — 1 is a rational number that is 
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smaller than x, which contradicts the minimality of x. This shows that Q does not have a 
smallest element. Therefore Q is not well-ordered. ▲ 

Hands-On Exercise 5.1.1 Show that the interval [0, 1] is not well-ordered by finding a subset 
that does not have a smallest element. 


A 


Summary and Review 

• A set of real numbers (which could be decimal numbers) is said to be well-ordered if every 
nonempty subset in it has a smallest element. 

• A well-ordered set must be nonempty and have a smallest element. 

• Having a smallest element does not guarantee that a set of real numbers is well-ordered. 

• A well-ordered set can be finite or infinite, but a finite set is always well-ordered. 

Exercises 5.1 

1. Find the smallest element in each of these subsets of N. 

(a) {n € N | n = m 2 — 10m + 28 for some integer m} 

(b) {n e N | n = 5q + 3 for some integer q } 

(c) {n € N | n = —150 — 17 d for some integer d} 

(d) {n € N | n = 4s + 9t for some integers s and t} 

2. Determine which of the following subsets of R are well-ordered: 

(a) {} 

(b) {-9, -7, -3, 5, 11} 

(c) {0}UQ+ 

(d) 2Z 

(e) 5N 

(f) {-6, -5, -4,...} 

3. Show that the interval [3, 5] is not well-ordered. 

Hint: Find a subset of [3, 5] that does not have a smallest element. 

4. Assume 0 ^ li C T 2 C R. Show that if T 2 is well-ordered, then T\ is also well-ordered. 

Hint: Let S' be a nonempty subset of T\. We want to show that S has a smallest element. 

To achieve this goal, note that T\ C T 2 . 

5. Prove that 2N is well-ordered. 

Hint: Use Problem 4 

6. Assume 0 ^ Ti C T 2 C M. Prove that if T\ does not have a smallest element, then T 2 is 
not well-ordered. 

5.2 Division Algorithm 

When we divide a positive integer (the dividend) by another positive integer (the divisor), we 
obtain a quotient. We multiply the quotient to the divisor, and subtract the product from 
the dividend to obtain the remainder. Such a division produces two results: a quotient and a 
remainder. 
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Step 1: Show that 
there exist q,r £ Z 
such that b — aq + r. 


This is how we normally divide 23 by 4: 


5 

4) 23 
20 

3"" 

In general, the division b 4- a takes the form 

Q 

a ) b 
aq 
r 

so that r = b — aq, or equivalently, b = aq + r. Of course, both q and r are integers. Yet, the 
following “divisions” 

4 2 6 7 

4 ) 23 4 ) 23 4 ) 23“ 4) 23~~ 

16 8 24 28 

is -i 

also satisfy the requirement b = aq + r, but that is not what we normally do. This means having 
b = aq + r alone is not enough to define what quotient and remainder are. We need a more rigid 
definition. 

Theorem 5.2.1 (Division Algorithm) Given any integers a and b, where a > 0, there exist 
integers q and r such that 

b = aq + r, 

where 0 < r < a. Furthermore, q and r are uniquely determined by a and b. 

The integers b, a, q, and r are called the dividend, divisor, quotient, and remainder, 
respectively. Notice that b is a multiple of a if and only if r = 0. 

The division algorithm describes what happens in long division. Strictly speaking, it is not 
an algorithm. An algorithm describes a procedure for solving a problem. The theorem does not 
tell us how to find the quotient and the remainder. Some mathematicians prefer to call it the 
division theorem. Here, we follow the tradition and call it the division algorithm. 

Remark. This is the outline of the proof: 

1. Describe how to find the integers q and r such that b = aq + r. 

2. Show that our choice of r satisfies 0 < r < a. 

3. Establish the uniqueness of q and r. 

Regarding the last part of the proof: to show that a certain number x is uniquely determined, 
a typical approach is to assume that x' is another choice that satisfies the given condition, and 
show that we must have x = x' . <C> 

Proof: We first show the existence of q and r. Let 

S = {b — ax | x € Z and b — ax > 0}. 

Clearly, S' is a set of nonnegative integers. To be able to apply the principle of well-ordering, 
we need to show that S is nonempty. Here is a constructive proof. 

• Case 1. If b > 0, we can set x = 0. Then b — ax = b > 0. 
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• Case 2. If b < 0, set x = b. Since a > 1, we have 1 — a < 0. Then 

b — ax = b — ab = 6(1 — a) > 0. 


Since S is nonempty, it follows from the principle of well-ordering that S has a smallest element. 
Call it r. From the definition of S, there exists some integer q such that b — aq = r. 

Next, we show that 0 < r < a. The definition of S tells us immediately that r > 0, so we 
only need to show that r < a. Suppose, on the contrary, r > a. Then r = a + t for some integer 
t > 0. Now b — aq = r = a + t implies that 

0 <t = b — aq — ci = b — a(q + 1). 

So t £ S. Now t = r — a < r suggests that we have found another element in S which is even 
smaller than r. This contradicts the minimality of r. Therefore r < a. 

Finally, we have to establish the uniqueness of both q and r. Let q' and r' be integers such 
that 

b = aq' + r' f 0 < r' < a. 

From aq + r = b = aq' + r' , we find a(q — q') = r' — r. Hence 

a\q-q'\ = \r' - r\. 

Since | r' — r\ is an integer, if | r' — r\ ^ 0, we would have a < \r' — r|. From 0 < r,r' < a , we 
deduce that | r' — r| < a, which clearly contradicts our observation that a < \r' — r\. Hence, 
| r' — r\ =0. Then r' = r. It follows that q' = q. So the quotient q and the remainder r are 
unique. ■ 


You should not have any problem dividing a positive integer by another positive integer. 
This is the kind of long division that we normally perform. It is more challenging to divide a 
negative integer by a positive integer. When b is negative, the quotient q will be negative as 
well, but the remainder r must be nonnegative. In a way, r is the deciding factor: we choose q 
such that the remainder r satisfies the condition 0 < r < a. 

In general, for any integer b, dividing b by a produces a decimal number. If the result is not 
an integer, round it down to the next smaller integer (see Example 6.1.3). It is the quotient q 
that we want, and the remainder r is obtained from the subtraction r = b — aq. For example, 


-22 

~T 


-3.1428... . 


Rounding it down produces the quotient q = —4, and the remainder is r = —22 — 7(— 4) = 6; 
and we do have —22 = 7 • (—4) + 6. 


Hands-On Exercise 5.2.1 Compute the quotients q and the remainders r when b is divided 
by a: 

(a) b = 128, a = 7 (b) b = -128, a = 7 (c) b = -389, a = 16 

Be sure to verify that b = aq + r. 


A 


Step 2: Show that r 
satisfies the criterion 
0 < r < a. 


Step 3: Establish the 
uniqueness of q and r 


The division algorithm can be generalized to any nonzero integer a. 
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Corollary 5.2.2 Given any integers a and b with a ^ 0, there exist uniquely determined integers 
q and r such that b = aq + r, where 0 < r < |a| . 

Proof: We only have to consider the case of a < 0. Since —a > 0, the original Euclidean 
Algorithm assures that there exist uniquely determined integers q' and r such that 

b = (-a) -q' + r, 

where 0 < r < —a = \a\. Therefore, we can set q = —q'. ■ 


Remember: what 
really matters is the 
choice of r; it must 
satisfy 0 < r < |a|. 


Example 5.2.1 Not every calculator or computer program computes q and r the way we want 
them done in mathematics. The safest solution is to compute |6| = |a| in the usual way, inspect 
the remainder to see if it fits the criterion 0 < r < |a|. If necessary, adjust the value of q so that 
the remainder r satisfies the requirement 0 < r < |a|. Here are some examples: 


b 

a 

b = aq + r 

q 

r 

14 

4 

14 = 4 • 3 + 2 

3 

2 

-14 

4 

-14 = 4- (-4) +2 

-4 

2 

-17 

-3 

-17 = (-3) -6 + 1 

6 

1 

17 

-3 

17 = (-3) -(-5) + 2 

-5 

2 


The quotient q can be positive or negative, and the remainder r is always nonnegative. A 


Definition. Given integers a and 6, with a ^ 0, let q and r denote the unique integers such 
that b = aq + r, where 0 < r < |a|. Define the binary operators div and mod as follows: 

b div a = q, 
b mod a = r. 

Therefore, b div a gives the quotient, and b mod a yields the remainder of the integer division 
b fr a. Recall that b div a can be positive, negative, or even zero. But b mod a is always a 
nonnegative integer less than |a|. <C> 


Example 5.2.2 From the last example, we have 

(a) 14 div 4 = 3, and 14 mod 4 = 2. 

(b) —14 div 4 = —4, and —14 mod 4 = 2. 

(c) —17 div —3 = 6, and —17 mod —3 = 1. 

(d) 17 div —3 = —5, and 17 mod —3 = 2. 

Do not forget to check the computations, and remember that a need not be positive. A 

Hands-On Exercise 5.2.2 Complete the following table: 


b 

a 

b div a 

b mod a 

334 

15 



334 

-15 



-334 

15 



-334 

-15 




Do not forget: b mod a is always nonnegative. A 

Example 5.2.3 Let n be an integer such that 

n div 6 = q 1 and n mod 6 = 4. 

Determine the values of (2 n + 5) div 6, and (2 n + 5) mod 6. 
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Solution: The given information implies that n = 6q + 4. Then 

2n + 5 = 2(6 q + 4) + 5 = 12q + 13 = 6(2 q + 2) + 1. 

Therefore, (2 n + 5) div 6 = 2q + 2, and (2 n + 5) mod 6 = 1. A 

Hands-On Exercise 5.2.3 Let n be an integer such that 

n div 11 = q, and n mod 11 = 5. 

Compute the values of (6 n — 4) div 11 and (6?r — 4) mod 11. 


A 

Example 5.2.4 Suppose today is Wednesday. Which day of the week is it a year from now? 

Solution: Denote Sunday, Monday, ... , Saturday as Day 0, 1, . . . 6, respectively. Today is 
Day 3. A year (assuming 365 days in a year) from today will be Day 368. Since 

368 = 7 • 52 + 4, 

it will be Day 4 of the week. Therefore, a year from today will be Thursday. A 

Hands-On Exercise 5.2.4 Suppose today is Friday. Which day of the week is it 1000 days 
from today? 


A 


Any integer divided by 7 will produce a remainder between 0 and 6, inclusive. Define 
Ai = {x € Z | x mod 7 = i} for 0 < i < 6, 


we find 

Z = A 0 U A x U A 2 U A 3 U A 4 U A 5 U A 6 , 
where the sets A, are pairwise disjoint. The collection of sets 

{A 0l A 1 ,A 2 , A 3 , Ai, A 5 , A e } 

is called a partition of Z, because every integer belongs to one and only one of these seven 
subsets. We also say that Z is a disjoint union of Ao, A\, . . . , Aq. The same argument also 
applies to the division by any integer n > 2. 

In general, a collection or family of finite sets {Si,S 2 , ■ ■ ■ , S n } is called a partition of the set 
S' if S' is the disjoint union of Si, S 2 , . . . S n . Partition is a very important concept, because it 
divides the elements of S into n classes S±, S 2 , . . . , S n such that every element of S belongs to 
a unique class. We shall revisit partition again when we study relations in Chapter 7. 
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Summary and Review 

• The division of integers can be extended to negative integers. 

• Given any integer b , and any nonzero integer a, there exist uniquely determined integers 
q and r such that b = aq + r, where 0 < r < |a|. 

• We call q the quotient, and r the remainder. 

• The reason we have unique choices for q and r is the criterion we place on r. It has to 
satisfy the requirement 0 < r < |a|. 

• In fact, the criterion 0 < r < |a| is the single most important deciding factor in our choice 
of q and r. 

• We define two binary operations on integers. The div operation yields the quotient, and 
the mod operation produces the remainder, of the integer division b 4- a. In other words, 
b div a = q, and b mod a = r. 


Exercises 5.2 


1. Find b div a and b mod a, 

(a) a = 13, b = 300 

2. Find b div a and b mod a, 

(a) a = 19, b = 79 
(d) a = -16, b= 172 


where 

(b) o = 11, 6= -120 
where 

(b) a = 59, b = 18 
(e) a = —8, b = —67 


(c) a = -22, b = 145 


(c) a = 16, b = -823 
(f) a = -12, b = —134 


3. Prove that 

b mod a € {0, 1, 2, ... , |a| — 1} 
for any integers a and b , where a ^ 0. 

4. Prove that among any three consecutive integers, one of them is a multiple of 3. 

Hint : Let the three consecutive integers be n, n + 1, and n + 2. What are the possible 
values of n mod 3? What does this translate into, according to the division algorithm? In 
each case, what would n, n + 1, and n + 2 look like? 

5. Prove that n 3 — n is always a multiple of 3 for any integer n by 

(a) A case-by-case analysis. 

(b) Factoring n 3 — n. 

6. Prove that the set {n, n + 4, n + 8, n + 12, n + 16} contains a multiple of 5 for any positive 
integer n. 

7. Let m and n be integers such that 

m div 5 = s, m mod 5 = 1, n div 5 = i, n mod 5 = 3. 


Determine 


(a) (m + n) div 5 
(c) (mn) div 5 


(b) ( m + n ) mod 5 
(d) (mn) mod 5 


8. Let m and n be integers such that 

m div 8 = s, m mod 8 = 3, n div 8 = i, n mod 8 = 6. 


Determine 
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(a) 

(m + 2) div 8 

(b) (to + 2) mod 8 

(c) 

(3 rrm) div 8 

(d) (3 mn) mod 8 

(e) 

(5m + 2 n) div 8 

(f) (5 to + 2 n) mod 8 

(g) 

(3m. — 2n) div 8 

(h) (3 to — 2 n) mod 8 


5.3 Divisibility 

In this section, we shall study the concept of divisibility. Let a and b be two integers such that 
a y 0. The following statements are equivalent: 

• a divides b , 

• a is a divisor of b, 

• a is a factor of b , 

• b is a multiple of a, and 

• b is divisible by a. 

They all mean 

There exists an integer q such that b = aq. 

In terms of division, we say that a divides b if and only if the remainder is zero when b is divided 
by a. We adopt the notation 

a | b [pronounced as “a divides &”] 

Do not use a forward slash / or a backward slash \ in the notation. To say that a does not 
divide 6, we add a slash across the vertical bar, as in 

a\b [pronounced as “a does not divide 6”] 

Do not confuse the notation a \ b with | . The notation | represents a fraction. It is also written 

as a/b with a (forward) slash. It uses floating-point (that is, real or decimal) division. For 

example, -j- = 2.75. 

The definition of divisibility is very important. Many students fail to finish very simple 
proofs because they cannot recall the definition. So here we go again: 

a | b <t=> b = aq for some integer q. 

Both integers a and b can be positive or negative, and b could even be 0. The only restriction 
is a 0. In addition, q must be an integer. For instance, 3 = 2 • |, but it is certainly absurd to 
say that 2 divides 3. 

Example 5.3.1 Since 14 = (—2) • (—7), it is clear that —2 | 14. A 

Hands-On Exercise 5.3.1 Verify that 

5 | 35, 8 {35, 25 {35, 7 | 14, 2 | —14, and 14 | 14, 

by finding the quotient q and the remainder r such that b = aq + r, and r = 0 if a \ b. 


Memorize this 
definition! 


Write a \ b, not a/b 
or a\b. 


Memorize this 
definition. 


A 
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The definition of even 
and odd integers. 


0 is divisible by any 
nonzero integer. 


Example 5.3.2 An integer is even if and only if it is divisible by 2, and it is odd if and only 
if it is not divisible by 2. A 

Hands-On Exercise 5.3.2 What is the remainder when an odd integer is divided by 2? Com- 
plete the following sentences: 

• If n is even, then n = for some integer . 

• If n is odd, then n = for . 

Memorize them well, as you will use them frequently in this course. A 

Hands-On Exercise 5.3.3 Complete the following sentence: 

• If n is not divisible by 3, then n = , or n = , for some integer . 

Compare this to the div and mod operations. What are the possible values of n mod 3? A 

Example 5.3.3 Given any integer a / 0, we always have a | 0 because 0 = a ■ 0. In particular, 
0 is divisible by 2, hence, it is considered an even integer. A 

Example 5.3.4 Similarly, ±1 and ±b divide b for any nonzero integer b. They are called the 
trivial divisors of a. A divisor of b that is not a trivial divisor is called a nontrivial divisor 
of b. 

For example, the integer 15 has eight divisors: ±1, ±3, ±5, ±15. Its trivial divisors are ±1 
and ±15, and the nontrivial divisors are ±3 and ±5. A 

Definition. A positive integer a is a proper divisor of b if a \ b and a < |6|. If a is a proper 
divisor of b, we say that a divides b properly. <£> 

Remark. Some number theorists include negative numbers as proper divisors. In this con- 
vention, a is a proper divisor of b if a \ b. and |o| < |6|. To add to the confusion, some number 
theorists exclude ±1 as proper divisors. Use caution when you encounter these terms. 

Example 5.3.5 It is clear that 12 divides 132 properly, and 2 divides —14 properly as well. 
The integer 11 has no proper divisor. A 

Hands-On Exercise 5.3.4 What are the proper divisors of 132? 


A 

Definition. An integer p > 1 is a prime if its positive divisors are 1 and p itself. Any integer 
greater than 1 that is not a prime is called composite. <C> 

Remark. A positive integer n is composite if it has a divisor d that satisfies 1 < d < n. Also, 
according to the definition, the integer 1 is neither prime nor composite. <0> 

Example 5.3.6 The integers 2, 3, 5, 7, 11, 13, 17, 19, 23, . . . are primes. A 

Hands-On Exercise 5.3.5 What are the next five primes after 23? 


A 
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Theorem 5.3.1 There are infinitely many primes. 

Proof: We postpone its proof to a later section, after we prove a fundamental result in number 
theory. ■ 

Theorem 5.3.2 For all integers a, b, and c where a 0, we have 

(1) If a | b, then a \ xb for any integer x. 

(2) If a | b and b \ c, then a | c. (This is called the transitive property of divisibility.) 

(3) If a | b and a \ c, then a | ( sb + tc) for any integers x and y. (The expression sb + tc is 
called a linear combination of b and c.) 

(4) If b ^ 0 and a \ b and b \ a, then a = ±b. 

(5) If a | b and a,b> 0, then a<b. 

Proof: We shall only prove (1), (4), and (5), and leave the proofs of (2) and (3) as exercises. 

Proof of (1). Assume a \ b , then there exists an integer q such that b = aq. For any integer x, 
we have 

xb = x ■ aq = a ■ xq, 

where xq is an integer. Hence, a \ xb. 

Proof of (4). Assume a \ b, and b \ a. Then there exist integers q and q' such that b = aq, and 
a = bq' . It follows that 

a = bq 1 = aq ■ q' . 

This implies that qq' = 1. Both q and q' are integers. Thus, each of them must be either 1 or 
— 1, which makes b = ±a. 

Proof of (5). Assume a \ b and a, b > 0. Then b = aq for some integer q. Since a, b > 0, we also 
have q > 0. Being an integer, we must have q > 1. Then b = aq > a ■ 1 = a. ■ 

Example 5.3.7 Use the definition of divisibility to show that given any integers a, b, and c, 
where a ^ 0, if a \ b and a \ c, then a \ (sb 2 + tc 2 ) for any integers s and t. 


Solution: We try to prove it from first principles, that is, using only the definition of divisibility. 
Here is the complete proof. 


Assume a \ b and a \ c. There exist integers x and y such that b = ax and c = ay. 
Then 

sb 2 + tc 2 = s(ax) 2 + t(ay) 2 = a(sax 2 + tay 2 ), 
where sax 2 + tay 2 is an integer. Hence a \ (sb 2 + tc 2 ). 


The key step is substituting b = ax and c = ay into the expression sb 2 + tc 2 . You may ask, how 
can we know this is the right thing to do? 

Here is the reason. We want to show that a \ (sb 2 + tc 2 ). This means we need to find an 
integer which, when multiplied by a, yields sb 2 +tc 2 . This calls for writing sb 2 +tc 2 as a product 
of a and another integer that is yet to be determined. Since s and t bear no relationship to 
a, our only hope lies in b and c. We do know that b = ax and c = ay, therefore, we should 
substitute them into sb 2 +tc 2 . A 
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Hands-On Exercise 5.3.6 Let a, b , and c be integers such that a ^ 0. Prove that if a \ b or 
a | c, then a \ be. 


A 


Summary and Review 

• An integer b is divisible by a nonzero integer a if and only if there exists an integer q such 
that b = aq. 

• An integer n > 1 is said to be prime if its only divisors are ±1 and ±n; otherwise, we say 
that n is composite. 

• If a positive integer n is composite, it has a proper divisor d that satisfies the inequality 
1 < d < n. 

Exercises 5.3 

1. Let a, b , and c be integers such that a ^ 0. Use the definition of divisibility to prove that 
if a | b and c | (—a), then (— c) | b. Use only the definition of divisibility to prove these 
implications. 

2. Let a, b , c, and d be integers with a, c ^ 0. Prove that 

(a) If a | b and c | d , then ac \ bd. 

(b) If ac | be, then a | b. 

3. Let a, b, and c be integers such that a, b ^ 0. Prove that if a \ b and b \ c, then a | c. 

4. Let a, b, and c be integers such that a ^ 0. Prove that if a \ b and a \ c, then a | ( sb + tc) 

for any integers s and t. 

5. Prove that if n is an odd integer, then n 2 — 1 is divisible by 4. 

6. Use the result from Problem 5 to show that none of the numbers 11, 111, 1111, and 11111 
is a perfect square. Generalize, and prove your conjecture. 

Hint : Let x be one of these numbers. Suppose a: is a perfect square, then x = n 2 for some 
integer n. How can you apply the result from Problem 5? 

7. Prove that the square of any integer is of the form 3 k or 3fc + 1. 

8. Use Problem 7 to prove that 3 to 2 — 1 is not a perfect square for any integer m. 

9. Use induction to prove that 3 | (2 2n — 1) for all integers n > 1. 

10. Use induction to prove that 8 | (5 2n + 7) for all integers n > 1. 

11. Use induction to prove that 5 | (n 5 — n) for all integers n > 1. 

12. Use induction to prove that 5 | (3 3n+1 + 2” +1 ) for all integers n > 1. 




5.4 Greatest Common Divisors 


129 


5.4 Greatest Common Divisors 

Given any two integers a and b , an integer c 0 is a common divisor or common factor of 
a and b if c divides both a and b. If, in addition, a and b are not both equal to zero, then the 
greatest common divisor , denoted by gcd(a, 6), is defined as the largest common divisor of 
a and b. Greatest common divisors are also called highest common factors. It should be clear 
that gcd(a, b) must be positive. 

Example 5.4.1 The common divisors of 24 and 42 are ±1, ±2, ±3, and ±6. Among them, 6 
is the largest. Therefore, gcd(24, 42) = 6. The common divisors of 12 and 32 are ±1, ±2 and 
±4, it follows that gcd(12,32) = 4. 

Hands-On Exercise 5.4.1 Verify that 

gcd(5, 35) = 5, gcd(— 5, 10) = 5, gcd(20, -10) = 10, and gcd(20, 70) = 10. 

Explain why gcd(3, 5) = 1. 


A 


Example 5.4.2 Can you explain why gcd(0, 3) = 3? How about gcd(0, —3) = 3? 

Solution: Recall that 0 is divisible by any nonzero integer. Hence, all the divisors of 3 are also 
divisors of 0. Obviously, 3 itself is the largest divisor of 3. Therefore, gcd(0, 3) = 3. ▲ 

Hands-On Exercise 5.4.2 Explain why gcd(0, — 8) = 8. 


A 


Theorem 5.4.1 For any nonzero integer b, we have gcd(0, b) = |6|. 

Proof: The largest positive divisor of b is |6|. Since |6| also divides 0, we conclude that 
gcd(0, b) = |6|. ■ 

Theorem 5.4.1 tells us that gcd(0, b) = |6| if b is nonzero. From the definition of common 
divisor and greatest common divisor, it is clear that gcd(a, b) = gcd(6, a), and gcd(a, b) = 
gcd(±a, ±6). So we may assume 1 < a <b. 

Theorem 5.4.2 Let a and b be integers such that 1 < a < b. If b = aq + r, where 0 < r < a, 
then gcd(6, a) = gcd(a, r) . 

Proof: To facilitate our argument, let d = gcd (b,a) and e = gcd(a,r). By definition, d is a 
divisor of both b and a. Therefore, b = dx and a = dy for some integers x and y. Then 

r = b — aq = dx — dy ■ q = d(x — yq), 

where x — yq is an integer. Hence, d \ r. This makes d a common divisor of both r and a. Since 
e is the greatest common divisor of a and r, we determine that d < e. 


gcd(a, b) is always 
positive. 


If b 0, then 
gcd(0, b) =.\b\. 


Assume 1 < a < b. 


The main theorem. 
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Similarly, e = gcd(a, r) is a divisor of both a and r. Thus, a = eu and r = ev for some 
integers u and v. Then 

b = aq + r = a ■ eu + ev = e(au + v), 

where au + v is an integer. Hence, e | b. This makes e a common divisor of both b and a. Since 
d is the greatest common divisor of b and a, we deduce that e < d. Together with d < e, we 
conclude that d = e. ■ 


Example 5.4.3 From 997 = 996 • 1 + 1, we obtain gcd(997, 996) = gcd(996, 1) = 1. ▲ 


The theorem assures that gcd(6, a) = gcd(a, r). We can apply the theorem again to gcd(a, r). 
Dividing a by r produces a new quotient and a new remainder. If necessary, repeat the process 
until the remainder becomes zero. If we denote b = rg and a = ry , then 

I’O = 

r iqi +r 2 , 

0 < r 2 < ri , 

r i = 

T 2<?2 + r 3 , 

0 < r 3 < r 2 , 

?’2 = 

r 3 q 3 + r 4 , 

0 < r 4 < r 3 , 

-i 

?r 

1 

II 

rkqk + r k + i, 

0 < r k + i < Tfc, 

r n - 3 = 

Tn— 2<Zn— 2 T r n — 1 , 

0 < r„_ i < r n - 2 , 

r„~ 2 = 

^n—lQn—l T T n , 

r„ = 0. 

It follows that 



gcd(6, a) = gcd(r 0 ,rr) 

= gcd(n,r 2 ) = • • • = 

= gcd(r„_i,r„) = gcd(r„_i,0) = r„ j . 


The last nonzero remainder is gcd(a, 6). This method for finding the greatest common divisor 
is called Euclidean algorithm. 

Example 5.4.4 Find gcd(426, 246). 

Solution: By applying the theorem repeatedly, we find 


426 = 

246 • 1 + 180, 

gcd(426, 246) 

= gcd(246, 180) 

246 = 

180 • 1 + 66, 

gcd(246, 180) 

= gcd(180,66) 

180 = 

66 • 2 + 48, 

gcd(180, 66) 

= gcd(66, 48) 

66 = 

48 • 1 + 18, 

gcd(66, 48) 

= gcd(48, 18) 

48 = 

18-2 + 12, 

gcd(48, 18) 

= gcd(18, 12) 

18 = 

12-1 + 6, 

gcd(18, 12) 

= gcd(12, 6) 

12 = 

6-2 + 0, 

gcd(12, 6) 

= gcd(6, 0) = 6. 


Therefore, gcd(426, 246) = 6. ▲ 

Hands-On Exercise 5.4.3 Determine gcd(732, 153). 


A 
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Hands-On Exercise 5.4.4 Determine gcd(6958, 2478). 


A 


By hand, it is more efficient to use a two-column format. First, put the two numbers 426 
and 246 in two separate columns, with the larger number on the left. Perform a short division, 
and write the quotient on the left: 


426 

246 

246 


180 



In the next round, perform another short division on the two numbers 246 and 180 at the 
bottom. Since the larger number is now on the right column, leave the quotient to its right: 


1 

426 

246 


246 

180 


180 

66 


Continue in this manner until the remainder becomes 0. The last nonzero entry at the bottom 
is the greatest common divisor. We can also leave all the quotients on the left: 


1 

426 

246 

1 1 

426 

246 


246 

180 

1 

246 

180 

2 

180 

66 

1 2 

180 

66 


132 

48 

1 

132 

48 

2 

48 

18 

1 or 2 

48 

18 


36 

12 

1 

36 

12 

2 

12 

6 

2 

12 

6 


12 



12 



0 



0 



Hands-On Exercise 5.4.5 Use the two-column format to compute gcd(153, 732). 


A 
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Hands-On Exercise 5.4.6 Use the two-column format to compute gcd(6958, 2478). 


A 

Given any integers m and n, the numbers of the form ms + nt, where s,t are integers, are 
called the linear combinations of m and n. They play an important role in the study of 
gcd(m, n), as indicated in the next theorem. 

Theorem 5.4.3 For any nonzero integers a and b, there exist integers s and t such that 
gcd(a, b) = as + bt. 

Proof: The proof of this theorem is lengthy and complicated. We leave it, along with other 
related results, many of which are rather technical, to the next section. ■ 

Theorem 5.4.4 Every linear combination of a and b is a multiple o/gcd(a, b). 

Corollary 5.4.5 The greatest common divisor of two nonzero integers a and b is the smallest 
positive integer among all their linear combinations. 

It is important to understand what these three results say. Finding a linear combination of 
a and b only gives us a multiple of gcd(a, b). Only a special linear combination will produce the 
exact value of gcd(a, b). 

Example 5.4.5 Let n and n + 1 be two consecutive positive integers. Then 

n ■ (-1) + (n + 1) ■ 1 = 1 

implies that 1 is a multiple of the greatest common divisor of n and n + 1. This means the 
greatest common divisor must be 1. Therefore, gcd(n,n+ 1) = 1 for all integers n. A 

Definition. Two integers a and b are said to be relatively prime if gcd(a, b) = 1. Therefore, 
a and b are relatively prime if they have no common divisors except ±1. <0> 

Example 5.4.6 Prove that if gcd(a, b ) = 1, then gcd (a + 6, a — b) equals to 1 or 2. 

Solution: From the linear combinations 

A linear combination 
is only a multiple of 
the gcd. 

we know that gcd(o + 6, a — b) divides both 2 a and 2b. Since gcd (a, 6) — 1, we conclude that 
gcd(a + b, a — b) divides 2. Consequently, gcd(a + 6, a — b) is either 1 or 2. A 

Example 5.4.7 Show that if gcd(a, 6) = 1, then gcd(2a + b, a + 2b) equals to either 1 or 3. 


(a + b) • 1 + (a — b) ■ 1 = 2a, 

(a + b) • 1 + (a — b) • (— 1) = 26, 
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Solution: From the linear combinations 

(2a + b) ■ 2 + (a + 2b) • (— 1) = 3a, 

(2a + b) • (—1) + (a + 26) • 2 = 3b, 

we know that gcd(2a + b, a + 2b) divides both 3a and 3b. Since gcd(a, b) = 1, we conclude that 
gcd(2a + b, a + 2b) divides 3. Thus, gcd(a + b, a — b) is 1 or 3. A 


Hands-On Exercise 5.4.7 What are the possible values of gcd(5m + In, 7m + 5 n) if the two 
positive integers m and n are relatively prime? 


A 


Example 5.4.8 Find the integers s and t such that 6 = gcd(426, 246) = 246s + 426f. 


Solution: Earlier, we studied how to find gcd(426, 246) = 6. In each division, we want to 
express the remainder as a linear combination of 246 and 426. This is how the computation 
proceeds: 


426 = 246 • 1 + 180, 

180 = 246- (-1) +426- 1 

246 = 180 • 1 + 66, 

66 = 246-1 + 180- (-1) 

= 246- 1 + [246- (-1) +426- 1] ■ (-1) 

= 246-2 + 426- (-1) 

180 = 66 • 2 + 48, 

48 = 180 • 1 + 66 • (-2) 

= [246 • (-1) + 426 • 1] • 1 + [246 • 2 + 426 • (-1)] • (-2) 
= 246(— 5) + 426 • 3 

66 = 48 • 1 + 18, 

18 = 66- 1 + 48- (-1) 

= [246 • 2 + 426 • (-1)] • 1 + [246(-5) + 426 • 3] • (-1) 

= 246 • 7 + 426 • (-4) 

48 = 18-2 + 12, 

12 = 48- 1 + 18- (-2) 

= [246(— 5) + 426 • 3] • 1 + [246 • 7 + 426 • (-4)] • (-2) 

= 246 -(-19) +426 -11 

18 = 12- 1 + 6, 

6 = 18- 1 + 12- (-1) 


= [246 • 7 + 426 • (-4)] • 1 + [246 • (-19) + 426 • 11] • (-1) 
= 246-26 + 426- (-15) 


The answer is 6 = 246 • 26 + 426 • (—15). 


A 


The computation is tedious! The extended Euclidean algorithm provides a relief. It 
keeps track of two sequences of integers s k and t k alongside with r k , such that 


r k = as k + bt k - 


This expresses every remainder as a linear combination of a and b. Since the last nonzero 
remainder is gcd (a,b), the corresponding linear combination will be the answer we are looking 
for. 
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The values of s k and t k for the last example are summarized below: 


k 

Tk 

s k 

t k 

2 

180 

-1 

1 

3 

60 

2 

-1 

4 

48 

-5 

3 

5 

18 

7 

-4 

6 

12 

-19 

11 

7 

6 

26 

-15 


The main issue is: how can we compute these values efficiently? 

The table above starts with k = 2. How about k = 0 and k = 11 From 

b = r 0 = as 0 + bt 0 , 

we determine that so = 0 and to = 1- Similarly, 

a = r\ = as i + bti 

implies that si = 1 and t\ = 0. Hence, the list of values of Sk and t k start with the following: 

k Sk tk 

0 0 1 

1 1 0 

In general, before we carry out the division rk-i = Tk, we should have already generated so 
through Sk, and to through tk- After the division, we obtain qk and Vk+i as in 

r k -i = r k q k +r k + 1 - 

Next, we compute and t k +i before moving on to the next division. We find 

^k -\- 1 = Tk—1 T'kQk 

= (as k -i + bt k -i) - (as k + bt k )q k 
= a(s fc _ i - s k q k ) + b(t k - i - t k q k ). 

Therefore, we need 

Sfc+i = s k - i — s k q k , 
lk +1 = t'k —1 t k q k . 

In words: 

next s-value = previous-previous s-value — previous s-value x corresponding q 1 
next t - value = previous-previous t - value — previous t - value x corresponding q. 

For example, assume at a certain stage, the values of s, t, and q are as follow: 

k s k t k q k 

0 0 1 
110 1 
2-1 11 
3 2-12 

1 
2 
1 
2 
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Then 


next s-value = — 1 — 2 • 2 = —5, 
next f-value = 1 — (—1) -2 = 3. 

Now the list becomes 

k Sk tk qk 

0 0 1 

110 1 
2-1 11 
3 2-12 

4-5 3 1 

2 
1 
2 

The entire computation can be carried out in a modified two-column format. 


Example 5.4.9 Find integers s and t such that gcd(246,426) = 246s + 426f. 


Solution: First, copy the quotients from the right-most column and insert them between those 
quotients in the left-most column: 


1 

426 

246 

1 1 

426 

246 


246 

180 

1 

246 

180 

2 

180 

66 

1 2 

180 

66 


132 

48 

1 

132 

48 

2 

48 

18 

1 becomes 2 

48 

18 


36 

12 

1 

36 

12 

2 

12 

6 

2 

12 

6 


12 



12 



0 



0 



Next, compute Sk and tk alongside these quotients (we do not need to record the values of k): 


Sk 

tk 

qk 



0 

1 




1 

0 

l 

426 

246 

-1 

1 

l 

246 

180 

2 

-1 

2 

180 

66 

-5 

3 

1 

132 

48 

7 

-4 

2 

48 

18 

-19 

11 

1 

36 

12 

26 

-15 

2 

12 

6 




12 





0 



The last nonzero remainder is the greatest common divisor, and the last linear combination 
gives the desired answer. We find gcd(246, 426) = 6 = 26 • 246 — 15 • 426. A 


Observe that, starting with k = 2, the signs of Sk and tk alternate. This provides a quick 
check of their signs. In addition, the signs of Sk and tk are opposite for each k > 2. 
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Hands-On Exercise 5.4.8 Use the two-column format to find the linear combination that 
produces gcd(153, 732). 


A 

Hands-On Exercise 5.4.9 Use the two-column format to find the linear combination that 
produces gcd(2478, 6958). 


A 


Summary and Review 

• The greatest common divisor of two integers, not both zero, is the largest (hence it must 
be positive) integer that divides both. 

• Use Euclidean algorithm to find the greatest common divisor. It can be implemented in a 
two-column format. 

• Using an extended version with two additional columns for computing s& and A, we can 
find the special linear combination of two integers that produces their greatest common 
divisor. 

• In general, a linear combination of two integers only gives a multiple of their greatest 
common divisor. 

Exercises 5.4 

1. For each of the following pairs of integers, find the linear combination that equals to their 
greatest common divisor. 

(a) 27, 81 (b) 24, 84 (c) 1380, 3020 

2. For each of the following pairs of integers, find the linear combination that equals to their 
greatest common divisor. 

(a) 120, 615 (b) 412, 936 (c) 1122, 3672 

3. What are the possible values of gcd(2a + 56, 5 a — 26) if the two positive integers a and 6 
are relatively prime? 
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4. Prove that any consecutive odd positive integers are relatively prime. 

5. Let m and n be positive integers. Prove that gcd(m, m + n) \ n. 

6. Let a and b be integers such that 1 < a < b and gcd(a, b) = 1. Prove that gcd(a+6, ab) = 1. 

7. What are the possible values of gcd(3m — 5 n, 5 m + 3 n) if the two positive integers m and 
n are relatively prime? 

8. What are the possible values of gcd(4p + 7 q, 7 p — 4 q) if the two positive integers p and q 
are relatively prime? 

5.5 More on GCD 

In this section, we shall discuss a few technical results about gcd(o, b). 

Theorem 5.5.1 Let d = gcd(a, b), where a, b £ N. Then 

{as + bt | s, t £ Z} = {nd \ n G Z}. 

Hence, every linear combination of a and b is a multiple o/ gcd(a, b), and vice versa, every 
multiple of gcd(a,6) is expressible as a linear combination of a and b. 

Proof: For brevity, let 

S = {as + bt | s, t £ Z}, and T = {nd \ n £ Z}. 

We shall show that S = T by proving that S C T and T C S. 

Let x £ S. To prove that S C T, we want to show that x £ T as well. Being in S means 
x = as + bt for some integers s and t. Since d = gcd(a, b), we know that d \ a and d \ b. Hence, 
a = da' and b = db' for some integers a' and b' . Then 

x = as + bt = da's + db't = d(a' s + b't), 

where a' s + b't is an integer. This shows that a; is a multiple of d. Hence, x £ T. 

To show that T C S, it suffices to show that d £ S. The reason is, if d = as + bt for some 
integers s and t, then nd = n(as + bt) = a(ns) + b(nt) implies that nd € S. 

To prove that d £ S, consider S + . Since a = a ■ 1 + b ■ 0, we have a £ S + . Hence, S + is a 
nonempty set of positive integers. The principle of well-ordering implies that S + has a smallest 
element. Call it e. Then 

e = as* + bt* 

for some integers s* and t* . We already know that a £ S + . Being the smallest element in S + , 
we must have e < a. Then a = eq + r for some integers q and r, where 0 < r < e. If r > 0, then 

r = a — eq = a— (as* + bt*)q = a(l — s*q) + b(—t*q). 

This makes r a linear combination of a and b. Since r > 0, we find r £ S + . Since r < e would 
contradict the minimality of e, we must have r = 0. Consequently, a = eq, thus e | a. Similarly, 
since b = a- 0 + b- l£S + ,we can apply the same argument to show that e | b. We conclude 
that e is a common divisor of a and b. 

Let / be any common divisor of a and b. Then / | a and / | b. It follows that / | (ax + by) 
for any integers x and y. In particular, / | (as* + bt*) = e. Hence, / < e. Since e is itself a 
common divisor of a and b, and we have just proved that e is larger than any other common 
divisor of a and 6, the integer e itself must be the greatest common divisor. It follows that 
d = gcd(a, b) = e £ S + . The proof is now complete. ■ 


Name the two sets, 
and give a brief 
outline of the proof. 


Part 1: S C T. 


Part 2: T C S. First, 
reduce it to a simpler 
problem. 

PWO states that 
S + has a smallest 
element e. 


This e is a common 
divisor of a and b. 


But e is larger than 
the other common 
divisors, hence, it 
must be the gcd. 
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Corollary 5.5.2 The greatest common divisor of two nonzero integers a and b is the smallest 
positive integer among all their linear combinations. In other words, gcd(a, b) is the smallest 
positive element in the set {as + bt \ s, t £ Z}. 

Corollary 5.5.3 For any nonzero integers a and b, there exist integers s and t such that 
gcd(a, b) = as + bt. 

Proof: Theorem 5.5.1 maintains that the set of all the linear combinations of a and b equals 
to the set of all the multiples of gcd(a, b ). Since gcd(a, b ) is a multiple of itself, it must equal to 
one of those linear combinations. Thus, gcd(a, b) = sa + tb for some integers s and t. ■ 

Theorem 5.5.4 Two nonzero integers a and b are relatively prime if and only if as + bt = 1 
for some integers s and t. 

Proof: The result is a direct consequence of the definition that a and b are said to be relatively 
prime if gcd(a, b) = 1. ■ 

Example 5.5.1 It is clear that 5 and 7 are relatively prime, so are 14 and 27. Find the linear 
combination of these two pairs of numbers that equals to 1. 

Solution: By inspection, or using the extended Euclidean algorithm, we find 3 • 5 — 2 • 7 = 1, 
and 2- 14- 1 • 27 = 1. A 

Hands-On Exercise 5.5.1 Show that gcd(133, 143) = 1 by finding an appropriate linear com- 
bination. 


A 

Hands-On Exercise 5.5.2 Show that 757 and 1215 are relatively prime by finding an appro- 
priate linear combination. 


A 

Example 5.5.2 It follows from 

(-1) • n + 1 • (n + 1) = 1 

that gcd(n,n + 1) = 1. Thus, any pair of consecutive positive integers is relatively prime. A 

Theorem 5.5.5 (Euclid’s Lemma) Let a,b,c€ Z. //gcd(a,c) = 1 and c \ ab, then c \ b. 

Discussion: Let us write down what we know and what we want to show (WTS): 

Know : as + ct = 1 for some integers s and t, 
ab = cx for some integer x, 
b = cq for some integer q. 


WTS : 
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To be able to show that b = cq for some integer q , we have to come up with some information 
about b. This information must come from the two equations as + ct = 1 and ab = cx. Since 
6 = 6- 1, we can multiply b to both sides of as + ct = 1. By convention, we cannot write 

(as + ct = 1) • b. 

This notation is unacceptable! The reason is: we cannot multiply an equation by a number. 
Rather, we have to multiply both sides of an equation by the number: 

b = 1 • b = (as + ct) ■ b = asb + ctb. 

Obviously, ctb is a multiple of c; we are one step closer to our goal. Since asb = ab • s, and we 
do know that ab is indeed a multiple of c, so the proof can be completed. We are now ready to 
tie up the loose ends, and polish up the proof. <0> 

Proof: Assume gcd(a,c) = 1, and c | ab. There exist integers s and t such that 


as + ct = 1 . 


This leads to 


6=1-6 = (as + ct) • b = asb + ctb. 

Since c | ab, there exists an integer x such that ab = cx. Then 

6 = ab ■ s + ctb = cx ■ s + ctb = c(xs + tb), 

where xc + tb € Z. Therefore, c | 6. ■ 


Corollary 5.5.6 If a,b £ Z and p is a prime such that p \ ab, then either p \ a or p\b. 

Proof If p | a, we are done with the proof. If p \ a, then gcd(p, a) = 1, and Euclid’s lemma 
implies that p | 6. ■ 


We cannot apply the corollary if p is composite. For instance, 6 | 4 • 15, but 6 j 4 and 6 \ 15. 
On the other hand, when p \ ab, where p is a prime, it is possible to have both p \ a and p | 6. 
For instance, 5 | 15 • 25, yet we have both 5 | 15 and 5 | 25. 

Corollary 5.5.7 If a±, a %, . . . , a n £Z and p is a prime such that p \ a\a 2 • ■ ■ a n , then p \ ai for 
some i, where 1 < i < n. Consequently, if a prime p divides a product of n factors, then p must 
divide at least one of these n factors. 

Proof We leave the proof to you as an exercise. ■ 

Example 5.5.3 Prove that \/2 is irrational. 


Remark. We proved previously that \/2 is irrational in a hands-on exercise. The solution we 
presented has a minor flaw. A key step in that proof claims that 

The integer 2 divides to 2 , therefore 2 divides to. 

This claim is false in general. For example, 4 divides 6 , but 4 does not divide 6. Therefore, we 
have to justify why this claim is valid for 2. <£> 

Solution: Suppose \/2 is rational, then we can write 

TO 

n 


WRONG notation 
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To prove a 
biconditional 
statement, we need to 
prove both necessity 
and sufficiency. 


for some positive integers to and n that do not share any common divisor except 1. Squaring 
both sides and cross- multiplying gives 


2 n 2 = to 2 . 

Thus 2 divides to 2 . Since 2 is prime, Euclid’s lemma implies that 2 must also divide to. Then 
we can write to = 2s for some integer s. The equation above becomes 

2 n 2 = to 2 = (2s) 2 = 4s 2 . 


Hence, 


2s 


2 


which implies that 2 divides n 2 . Again, since 2 is prime, Euclid’s lemma implies that 2 also 
divides n. We have proved that both to and n are divisible by 2. This contradicts the assumption 
that m and n do not share any common divisor. Hence, y/2 must be irrational. ▲ 


Hands-On Exercise 5.5.3 


Prove that 


\fl is irrational. 


A 

We close this section with a truly fascinating result. 

Theorem 5.5.8 For any positive integers m and n, gcd (F m ,F n ) = -F gc d(m,n)- 
Corollary 5.5.9 For any positive integer n, 3 | F n <*=> 4 | n. 

Proof: (=>) If 3 | F n , then, because F 3 = 4, we have 

3 — gcd(3, E n ) = gcd ( F,[ , F n ) = F gcd ^ 4 ,n)* 

It follows that gcd(4, n) = 4, which in turn implies that 4 | n. 

(<=) If 4 | n, then gcd(4, n) = 4, and 

gcd(3, F n ) = gcd (F 4 ,F n ) = F gcd(4jrl) = F 4 = 3; 

therefore, 3 | F n . ■ 
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Summary and Review 

• Given any two nonzero integers, there is only one special linear combination that would 
equal to their greatest common divisor. 

• All other linear combinations are only multiples of their greatest common divisor. 

• If a and c are relatively prime, then Euclid’s lemma asserts that if c divides ab , then c 
must divide b. 

• In particular, if p is prime, and if p \ ab, then either p \ a or p | b. 


Exercises 5.5 

1. Given any arbitrary positive integer n, prove that 2 n + 1 and 3 n + 2 are relatively prime. 

2. Use induction to prove that for any integer n > 2, if «i, 02 , . . . , a n £ Z and p is a prime 
such that p | aia 2 • • • a n , then p \ at for some i, where 1 < i < n. 

3. Prove that yjp is irrational for any prime number p. 

4. Prove that \f2 is irrational. 

5. Given any arbitrary positive integers a, b, and c, show that if a | c, b | c, and gcd(a, b) = 1, 
then ab \ c. 

Remark. This result is very important. Remember it! 

6. Given any arbitrary positive integers a, b, and c, show that if a \ c, and b \ c, then ab \ cd, 
where d = gcd(a, b). 

7. Use induction to prove that 3 j (2 4n — 1) and 5 | (2 4n — 1) for any integer n > 1. Use these 
results to prove that 15 | (2 4 " — 1) for any integer n > 1. 

8. Prove that 2 | F n <=> 3 | n for any positive integer n. 


5.6 Fundamental Theorem of Arithmetic 

Primes are positive integers that do not have any proper divisor except 1. Primes can be 
regarded as the building blocks of all integers with respect to multiplication. 

Theorem 5.6.1 (Fundamental Theorem of Arithmetic) Given any integer n > 2, there 
exist primes p\ < P 2 < • • • < p s such that n = P 1 P 2 ■ ■ ■ Ps ■ Furthermore, this factorization is 
unique, in the sense that if n = qiq -2 . . .qt for some primes qi < q 2 < • • • < qt, then s = t and 
Pi = Qi f or each i, 1 <i < s. 

Proof: We first prove the existence of the factorization. Let S be the set of integers n > 2 that 
are not expressible as the product of primes. Since a product may contain as little as just one 
prime, S does not contain any prime. Suppose S ^ 0, then the principle of well-ordering implies 
that S has a smallest element d. Since S does not contain any prime, d is composite, so d = xy 
for some integers x and y , where 2 < x, y < d. The minimality of d implies that x, y ^ S. So 
both x and y can be expressed as products of primes, then d = xy is also a product of primes, 
which is a contradiction, because d belongs to S. Therefore, <5 = 0, which means every integer 
n > 2 can be expressed as a product of primes. 

Next, we prove that the factorization is unique. Assume there are two ways to factor n, say 
n = P 1 P 2 ■ ■ - Ps — Q 1 Q 2 ■ ■ - Qt- Without loss of generality, we may assume s < t. Suppose there 
exists a smallest i, where 1 < i < s, such that pt ^ Qi- Then 


Pi = Qi, P2 = q2, ■■■ Pi-i = qi- 1 , but p t £ q,. 
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It follows that 


PiPi+i ■■■p s = qiqi + 1 ■■■qt, 


in which both sides have at least two factors (why?). Without loss of generality, we may assume 
Pi < q.i. Since pi \ q^i+i ■ ■ ■ qt, and Pi is prime, Euclid’s lemma implies that Pi \ qj for some j, 
where i < j < t. Since qj is prime, we must have pi = qj > qt, which contradicts the assumption 
that pi < qi . Therefore, there does not exist any i for which pi ^ qi. This means pi = qt for 
each i, and as a consequence, we must have have s = t. ■ 


Interestingly, we can use the strong form of induction to prove the existence part of the 
Fundamental Theorem of Arithmetic. 


Proof 2: (Existence) Induct on n. The claim obviously holds for n = 2. Assume it holds for 
n = 2, 3, . . . , k for some integer k > 2. We want to show that it also holds for k + 1. If k + 1 is 
a prime, we are done. Otherwise, k + 1 = a/3 for some integers a and f3, both less than k + 1. 
Since 2 < a, /3 < k, both a and (3 can be expressed as a product of primes. Putting these primes 
together, and relabeling and rearranging them if necessary, we see that k + 1 is also expressible 
as a product of primes in the form we desire. This completes the induction. ■ 

The next result is one of the oldest theorems in mathematics, numerous proofs can be found 
in the literature. 

Theorem 5.6.2 There are infinitely many primes. 

Proof: Suppose there are only a finite number of primes pi,P 2 , ■ ■ ■ ,Pn- Consider the integer 


x = I+P1P2 ■■■Pn- 


It is obvious that x pi for any i. Since P\,P2, ■ ■ ■ ,Pn are assumed to be the only primes, the 
integer x must be composite, hence can be factored into a product of primes. Let pk be one of 
these prime factors, so that x = puq for some integer q. Then 

1 = X - P1P2 ■■■Pn 

= Pkq -P1P2 ■ ■ -Pn 
= Pk(q-PlP2---Pk~lPk+l---Pn), 

which is impossible. This contradiction proves that there are infinitely many primes. ■ 

Some of the primes listed in the Fundamental Theorem of Arithmetic can be identical. If we 
group the identical primes together, we obtain the canonical factorization or prime-power 
factorization of an integer. 

Theorem 5.6.3 All integers n > 2 can be uniquely expressed in the form n = p e fip e fi ■ • • p for 
some distinct primes pi and positive integers e, . 

Once we find the prime-power factorization of two integers, their greatest common divisor 
can be obtained easily. 

Example 5.6.1 From the factorizations 246 = 2-3-41 and 426 = 2 • 3 • 79, it is clear that 
gcd(246, 426) = 2-3 = 6. A 

Hands-On Exercise 5.6.1 Find the factorizations of 153 and 732, and use them to compute 
gcd(153, 732). 


A 
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Although the set of primes that divide two different positive integers a and b may be different, 
we could nevertheless write both a and b as the product of powers of all the primes involved. 
For example, by combining the prime factors of 

12300 = 2 2 • 3 • 5 2 • 41, and 34128 = 2 4 • 3 3 • 79, 

we could write them as 

12300 = 2 2 • 3 1 • 5 2 • 41 1 • 79°, and 34128 = 2 4 • 3 3 • 5° • 41° • 79 1 . 

It follows that 

gcd(12300, 34128) = 2 2 • 3 1 • 5° • 41° • 79° = 12. 

The generalization is immediate. 

Theorem 5.6.4 If a = p^p^ 2 ■ • • Pt* and b = p^p^ 2 ■ ■ ■ p{* for some distinct primes pi, where 
ej, fi > 0 for each i, then gcd(a, b) = p™ m ( ei >/i)pmm(e 2 ,/ 2 ) . . . . 

In this theorem, we allow the exponents to be zero. In the usual prime-power factorization, 
the exponents have to be positive. 

Hands-On Exercise 5.6.2 Compute gcd(2 3 • 5 • 7 • 1 1“ , 2 2 • 3 2 • 5 2 • 7 2 ). 


A 

Definition. The least common multiple of the integers a and 6, denoted lcm(a, 6), is the 
smallest positive common multiple of both a and b. <0> 

Theorem 5.6.5 If a = pffp e ff • ■ ■ p\ L and b = p^p^ 2 • • • p{ * for some distinct primes pi, where 
ei,fi > 0 for each i, then lcm(a,6) = p™ a '^ ei '^ p ™ ax ( e2 >A) . . .p™ ax ( e *>A)_ 

Hands-On Exercise 5.6.3 Compute lcm(2 3 • 5 • 7 • 1 1 2 , 2 2 • 3 2 • 5 2 • 7 2 ). 


A 

Corollary 5.6.6 For any positive integers a and b, we have ab = gcd(a, b) ■ lcm(a, b). 

Proof: For each i. one of the two numbers e,; and /, is the minimum, and the other is the 
maximum. Hence, 

e» + fi = min (e,, /*) + max(e, ; , /*), 

from which we obtain 

e; fi _ ei+fi _ min(e i ,/i)+max(/,,/ i ) _ min(ei,/i) max(ei,/i) 

Pi Pi Pi Pi Pi 

Therefore, ab equals the product of gcd(a, b) and lcm(a, b). ■ 

Example 5.6.2 Since 12300 = 2 2 • 3 1 • 5 2 • 41 1 • 79°, and 34128 = 2 4 • 3 3 • 5° • 41° • 79\ it follows 
that 

lcm(12300, 34128) = 2 4 • 3 3 • 5 2 • 41 1 • 79 1 = 34981200. 

We have seen that gcd(12300, 34128) = 12, and we do have 12 • 34981200 = 12300 • 34128. A 
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Hands-On Exercise 5.6.4 Knowing that gcd(246, 426) = 6, how would you compute the value 
of lcm(246, 426)? 


A 


Example 5.6.3 When we add two fractions, we first take the common denominator, as in 

7 5 _ 7 3 5 2 21 + 10 31 

8 + 12~8'3 + 12'2~ 24 “ 24' 

Clear enough, the least common denominator is precisely the least common multiple of the two 
denominators. ▲ 

Example 5.6.4 The control panel of a machine has two signal lights, one red and one blue. 
The red light blinks once every 10 seconds, and the blue light blinks once every 14 seconds. 
When the machine is turned on, both lights blink simultaneously. After how many seconds will 
they blink at the same time again? 

Solution: This problem illustrates a typical application of least common multiple. The red 
light blinks at 10, 20, 30, . . . seconds, while the blue light blinks at 14, 28, 42, . . . seconds. In 
general, the red light blinks at t seconds if t is a multiple of 10, and the blue light blinks when 
t is a multiple of 14. Therefore, both lights blink together when t is a multiple of both 10 and 
14. The next time it happens will be lcm(10, 14) = 70 seconds later. ▲ 

Hands-On Exercise 5.6.5 Two comets travel on fixed orbits around the earth. One of them 
returns to Earth every 35 years, the other every 42 years. If they both appear in 2012, when is 
the next time they will return to Earth in the same year? 


A 

Hands-On Exercise 5.6.6 Given relatively prime positive integers m and n, what are the 
possible values of lcm(4m — 6n, 6m + 4n)? 


A 


Example 5.6.5 What does 2Z n 3Z equal to? 

Solution: Assume x £ 2Z (~l 3Z, then x £ 2Z and x £ 3Z. This means a; is a multiple of 
both 2 and 3. Consequently, a: is a multiple of lcm(2, 3) = 6, which means x £ 6Z. Therefore, 
2Z n 3Z C 6Z. 
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Next, assume x G 6Z, then a; is a multiple of 6. Consequently, a: is a multiple of 2, as well 
as a multiple of 3. This means a; € 2Z, and x G 3Z. As a result, x € 2Z fl 3Z. Therefore, 
6Z C 2Z fl 3Z. Together with 2Z fl 3Z C 6Z, we conclude that 2Z n 3Z = 6Z. A 

Hands-On Exercise 5.6.7 What does 4Z n 6Z equal to? 


A 


Summary and Review 

• There are infinitely many primes. 

• Any positive integer n > 1 can be uniquely factored into a product of prime powers. 

• Primes can be considered as the building blocks (through multiplication) of all positive 
integers exceeding one. 

• Given two positive integers a and 6, their least common multiple is denoted as lcm(a, 6). 

• For any positive integers a and b , we have ab = gcd(a, b) • lcm(a, b). 

Exercises 5.6 

1. Find the prime-power factorization of these integers. 

(a) 4725 (b) 9702 

(c) 180625 (d) 1662405 

2. Find the least common multiple of each of the following pairs of integers. 

(a) 27, 81 (b) 24, 84 (c) 120, 615 

(d) 412, 936 (e) 1380, 3020 (f) 1122, 3672 

3. Richard follows a very rigid routine. He orders a pizza for lunch every 10 days, and has 
dinner with his parents every 25 days. If he orders a pizza for lunch and has dinner with 
his parents today, when will he do both on the same day again? 

4. Compute gcd(15 • 50, 25 • 21), and lcm(15 • 50, 25 • 21). 

5. What does 10Z n 15Z equal to? Prove your claim. 

6. Let m and n be positive integers. What does mZ D nZ equal to? Prove your claim. 

7. Let p be an odd prime. Show that 

(a) p is of the form 4/c + 1 or of the form 4 k + 3 for some nonnegative integer k. 

(b) p is of the form 67c + 1 or of the form 6 k + 5 for some nonnegative integer k. 

8. Give three examples of an odd prime p of each of the following forms 


(a) 4/c + 1 
(c) 6/c + 1 


(b) 4/c + 3 
(d) 6/c + 5 
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9. Prove that any prime of the form 3n + 1 is also of the form 6k + 1. 

10. Prove that if a positive integer n is of the form 3 k + 2, then it has a prime factor of the 

same form. 

Hint: Consider its contrapositive. 

11. Prove that 5 is the only prime of the form n 2 — 4. 

Hint : Consider the factorization of n 2 — 4. 

12. Use the result “Any odd prime p is of the form 6k + 1 or of the form 6k + 5 for some 
nonnegative integer k" to prove the following results. 

(a) If p > 5 is a prime, then p 2 + 2 is composite. 

(b) If p > q > 5 are primes, then 24 | (p 2 — q 2 ). 

5.7 Modular Arithmetic 

Modular arithmetic uses only a fixed number of possible results in all its computation. For 
instance, there are only 12 hours on the face of a clock. If the time now is 7 o’clock, 20 hours later 
will be 3 o’clock; and we do not say 27 o’clock! This example explains why modular arithmetic 
is referred to by some as clock arithmetic. 

Example 5.7.1 Assume the current time is 2:00 P.M. Write this as 14:00. Sixty five hours 
later, it would be 79:00. Since 

79 = 24 • 3 + 7, 

it will be 7:00 or 7 A.M. A 

Hands-On Exercise 5.7.1 Designate Sunday, Monday, Tuesday, . . . , Saturday as Day 0, 1, 
2, . . . , 6. If today is Monday, then it is Day 1. What day of the week will it be two years from 
now? Assume there are 365 days in a year. 


A 

In the clock example, we essentially regard 27 o’clock the same as 3 o’clock. They key is, we 
are only interested in the remainder when a value is divided by 12. 

mi = m 2 (mod n) Definition. Let n > 2 be a fixed integer. We say the two integers mi and m 2 are congruent 

4=> n | (mi — m 2 ). modulo n , denoted 

mi = m -2 (mod n) 

if and only if n | (mi — m 2 ). The integer n is called the modulus of the congruence. 

What does this notion of congruence have to do with remainders? The next result describes 
their connection. 

Theorem 5.7.1 Let n > 2 be a fixed integer. For any two integers m\ and m 2 , 
mi = m 2 (mod n) -O- mi mod n = m 2 mod n. 

Remark. Do not confuse the two notations. The notation “(mod n)” after mi = m 2 indicates 
a congruence relation, in which “mod n” are enclosed by a pair of parentheses, and the notation 
is placed at the end of the congruence. In contrast, the “mod” between mi and n, without 
parentheses, is a binary operation that yields the remainder when mi is divided by n. <0> 
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Proof: (=>) Assume ?Bi = m2 (mod n). The definition of congruence implies that we have 
n | (mi — m2)- Hence, 

mi — m2 = nq 

for some integer q. Let mi = nqi + r\ and m 2 = 77.92 + ^2 for some integers qi,q 2 ,fi,r 2 , such 
that 0 < n,r 2 < n. Then 

nq = mi — m 2 = n{q\ — q 2 ) + r\ — r 2 . 

Since ri — r 2 = n{q — 91 + 92 ), we conclude that n \ r\ — t^. However, 0 < r\,r 2 < n 
implies that |rr — r 2 \ < n. Therefore, we must have rr — r 2 = 0, or 7*1 = 7 * 2 - It follows that 
mi mod n = m 2 mod n. 

(-4=) Assume mi mod ti = m 2 mod n. According to the Division Algorithm, the remainder in 
an integer division is unique. Thus, m-\ = nqi + r and m 2 = nq 2 + r for some integers qi,q 2 ,r 
such that 0 < r < n. Then 

mi - m 2 = {nqi + r) - (■ nq 2 + r) = 77.(91 - q 2 ). 

Therefore, n | (mi — m 2 ). ■ 

Corollary 5.7.2 Let n > 2 be a fixed integer. Then 

a = 0 (mod n) n \ a. 

Theorem 5.7.1 tells us mi = m 2 (mod n) if and only if mi and m 2 share the same remainder 
when they are divided by n. Given any integer m, 

m mod n £ {0, 1, 2, . . . , n — 1}. 

We call these values the residues modulo n. In modular arithmetic, when we say “ reduced 
modulo n,” we mean whatever result we obtain, we divide it by n, and report only the smallest 
possible nonnegative residue. 

The next theorem is fundamental to modular arithmetic. 

Theorem 5.7.3 Let n > 2 be a fixed integer. If a = b (mod n) and c = d (mod n), then 

a + c = b + d (mod n) , 

ac = bd (mod n ). 

Proof: Assume a = b (mod n) and c = d (mod n ) . Then 71 | (a — b) and n \ (c — d). We can 
write 

a — b = ns, and c — d = nt 

for some integers s and t. Consequently, 

(a + c) — {b + d) = {a — b) + (c — d) = ns + nt = n{s + t), 
where s + t is an integer. This proves that a + c = b + d (mod n). We also have 

ac — bd = {b + ns)(d + nt) — bd = but + nsd + n 2 st = n{bt + sd + nst), 
where bt, + sd + nst is an integer. Thus, n | {ac — bd), which means ac = bd (mod n). ■ 

Because of Theorem 5.7.3, we can add or multiply an integer to both sides of a congruence 

without altering the congruences. 

Example 5.7.2 We can use subtraction to reduce 2370 modulo 11. Any multiple of 11 is 
congruent to 0 modulo 11. So we have, for example, 

2370 = 2370 (mod 11), and 0 = -2200 (mod 11). 


Recall the definition 
of congruence. 


The final answer in 
modular arithmetic 
is always between 0 
and n — 1, inclusive. 
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Use subtraction to 
reduce m modulo n. 


In modular 
arithmetic, the final 
answer is always 
between 0 and n — 1, 
inclusive. 


We can use negative 
values in the 
intermediate steps. 


Applying Theorem 5.7.3, we obtain 

2370 = 2370 - 2200 = 170 (mod 11). 

What this means is: we can keep subtracting appropriate multiples of n from m until the answer 
is between 0 and n — 1, inclusive. It does not matter which multiple of 11 you use. The point 
is, pick one that you can think of quickly, and keep repeating the process. Continuing in this 
fashion, we find 

170 = 170 - 110 = 60 (mod 11). 

Since 60 — 55 = 5, we determine that 2370 = 5 (mod 11). A 

Hands-On Exercise 5.7.2 Reduce 12457 to the smallest nonnegative residue modulo 17. 


A 

Example 5.7.3 In a similar manner, if to is negative, we can keep adding multiples of n to it 
until the answer is positive. For example, 

—278 = —278 + 300 = 52 (mod 11). 

it is obvious that 52 = 52 — 44 = 8 (mod 11). Thus, —278 = 8 (mod 11). A 

Hands-On Exercise 5.7.3 Evaluate —3275 mod 11. This is the same as reducing —3275 to 
the smallest nonnegative residue modulo 11. 


A 


In a complicated computation, reduce the result from each intermediate step before you 
carry on with the next step. This will simplify the computation tremendously. To further speed 
up the computation, we can use negative values in the intermediate step. Nonetheless, the final 
answer must be between 0 and n — 1. 


Example 5.7.4 Reduce 37 2 • 41 — 53 • 2 modulo 7. 
Solution: Take note that 


37 = 37- 35 = 

2 

(mod 7), 

41 = 41-42 = 

-1 

(mod 7), 

53 = 53-49 = 

4 

(mod 7). 


Therefore, 

37 2 • 41 - 53 • 2 = 2 2 • (-1) - 4 • 2 = -12 (mod 7). 
We determine that 37 2 • 41 — 53 • 2 = 2 (mod 7). 

Hands-On Exercise 5.7.4 Evaluate 56 3 • 22 • 17 — 35 • 481 (mod 9). 


A 


A 
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Tedious computation may become rather simple under modular arithmetic. 

Example 5.7.5 Show that if an integer n is not divisible by 3, then n 2 — 1 is always divisible 
by 3. Equivalently, show that if an integer n is not divisible by 3, then n 2 — 1 = 0 (mod 3). 

Solution 1: Let n be an integer not divisible by 3, then either n = 1 (mod 3), or n = 2 
(mod 3). 

• Case 1. If n = 1 (mod 3), then 

n 2 — 1 = l 2 — 1 = 0 (mod 3). 

• Case 2. If n = 2 (mod 3) , then 

n 2 — 1 = 2 2 — 1 = 3 = 0 (mod 3). 

In both cases, we have found that n 2 — 1 is divisible by 3. ▲ 

Solution 2: Let n be an integer not divisible by 3, then either n = 1 (mod 3), or n = 2 
(mod 3). This is equivalent to saying n = ±1 (mod 3). Then 

n 2 — 1 = (±1) 2 — 1 = 1 — 1 = 0 (mod 3), 

which means n 2 — 1 is divisible by 3. ▲ 

Hands-On Exercise 5.7.5 Use modular arithmetic to show that 5 | (n 5 — n) for any integer 

n. 


A 

Hands-On Exercise 5.7.6 Use modular arithmetic to show that n(n + l)(2n + 1) is divisible 
by 6 for any integer n. 


A 

Raising an integer to a large power poses a serious problem. We cannot just raise an integer 
to a large power, because the result could be so large that the calculator or computer has to 
convert it into a decimal value and start using scientific notation to handle it. Consequently, 
the answer will not be accurate. 

A better solution is to reduce the intermediate results modulo n after each multiplication. 
This will produce an accurate result, but it will take a long time to finish if the power is huge. 


See Hands-On 
Exercise 3.2.4 and 
Example 3.5.1. 
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Write 29 as a sum of 
powers of 2. 


The method of 
repeated squaring. 


Fortunately, there is a much faster way to perform exponentiation that uses a lesser number of 
multiplications. 


Example 5.7.6 Evaluate 5 29 (mod 11). 

Solution: First, write the exponent 29 as a sum of powers of 2. We can do it by inspection. 
Start with the highest power of 2 that is less than or equal to 29, and then work with whatever 
is left in the sum: 

29 = 16 + 13 = 16 + 8 + 5 = 16 + 8 + 4 + 1. 

We are essentially expressing 29 in base 2. We can now write 

5 29 = 5 1 6 + 8 + 4+ 1 _gl6.g8.g4g 

These powers of 5 can be obtained by means of repeated squaring: 

5 1 = 5, 

5 2 = 5 2 , 

5 4 = (5 2 ) 2 , 

5 8 = (5 4 ) 2 , 

5 16 = (5 8 ) 2 , 


The iteration is simple: each new power is obtained by squaring the previous power. Since we 
are doing modular arithmetic, we want to reduce each intermediate result modulo 11: 


It follows that 


5 

5 


(mod 11) 

5 2 = 

25 

= 3 

(mod 11) 

5 4 = 

3 2 

= 9 = -2 

(mod 11) 

5 8 = 

9 2 

= (— 2) 2 = 4 

(mod 11) 

5 16 = 

4 2 

= 16 = 5 

(mod 11) 

5 29 = 5 16 

• 5 8 

• 5 4 • 5 = 5 • 4 • (—2) • 5 

(mod 11). 

find 5 29 = 

E 9 

(mod 11). 



▲ 


Hands-On Exercise 5.7.7 


Use repeated squaring to find 7 45 (mod 11). 


A 
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Hands-On Exercise 5.7.8 Use repeated squaring to evaluate 9 58 (mod 23). 


A 


In modular arithmetic, we are basically working with the remainders only. The set of integers 
{0, 1, 2, . . . , n — 1} is called the set of integers modulo n, and is denoted by Z n (pronounced 
as Z mod n). In addition, we define two new arithmetic operations on Z„. They are called 
“addition” and “multiplication” because they work like the usual addition and multiplication, 
except that we have to apply the mod operation to the results. To distinguish them from the 
usual addition and multiplication, we denote them by © and ©, and are called “circled plus” 
and “circled dot,” respectively. Formally, 

a © b = (a + b) mod n, and aQb = (a ■ b) mod n. 


The addition and multiplication tables for Z 6 are listed below. 


© 

0 

1 

2 

3 

4 

5 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

5 

2 

0 

2 

4 

0 

2 

4 

3 

0 

3 

0 

3 

0 

3 

4 

0 

4 

2 

0 

4 

2 

5 

0 

5 

4 

3 

2 

1 


© 

0 

1 

2 

3 

4 

5 

0 

0 

1 

2 

3 

4 

5 

1 

1 

2 

3 

4 

5 

0 

2 

2 

3 

4 

5 

0 

1 

3 

3 

4 

5 

0 

1 

2 

4 

4 

5 

0 

1 

2 

3 

5 

5 

0 

1 

2 

3 

4 


Compare them to the tables for Z 7 . 


© 

0 

1 

2 

3 

4 

5 

6 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

5 

6 

2 

0 

2 

4 

6 

1 

3 

5 

3 

0 

3 

6 

2 

5 

1 

4 

4 

0 

4 

1 

5 

2 

6 

3 

5 

0 

5 

3 

1 

6 

4 

2 

6 

0 

6 

5 

4 

3 

2 

1 


© 

0 

1 

2 

3 

4 

5 

6 

0 

0 

1 

2 

3 

4 

5 

6 

1 

1 

2 

3 

4 

5 

6 

0 

2 

2 

3 

4 

5 

6 

0 

1 

3 

3 

4 

5 

6 

0 

1 

2 

4 

4 

5 

6 

0 

1 

2 

3 

5 

5 

6 

0 

1 

2 

3 

4 

6 

6 

0 

1 

2 

3 

4 

5 


In both addition tables, all possible values appear in every row and every column. The same 
is true in the nonzero rows and nonzero columns in the multiplication table for Z 7 . However, 
some of the rows in the multiplication table for Zg do not contain all the integers in Zg. This 
suggests that the algebraic properties of Z„ depend on the value of n. 

In fact, whenever n is prime, the addition and multiplication tables of Z„ behave like the 
ones in Z 7 . It can be shown that when n is prime, Z„ has the following properties. 


Z„ is pronounced 
Z mod n. 
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1. Both ® and © are commutative, meaning 

a © b = b © a and a © b = b © a 

for all a,b £ Z n . 

2. Both © and © are associative , meaning that 

(a © b) © c = a © (b © c) and (a © b) © c = a © (6 © c) 

for all a,b,c £ Z n . 

3. The operations © and © satisfy the distributive laws 

aO (6 © c) = (a © 6) © (a © c) and (6 © c) © a = {b © a) © (c © a) 

for all a,b,c £ Z n . 

4. The integer 0 is the additive identity, meaning that a©0=0©a=a for all a £ Z n . 

5. For every a £ Z n , there exists a unique integer a' £ Z n such that a © a' = 0. This integer 
a' is called the additive inverse or negative of a, and is denoted by —a. 

6 . The integer 1 is the multiplicative identity, meaning that a © 1 = 1 © a = a for all 

a £ Z n . 

7. For every integer a £ Z* (hence, « / 0), there exists a unique nonzero integer a' £ Z n 
such that a © a' = 1. This integer a' is called the multiplicative inverse or reciprocal 
of a, and is denoted by a~ x . 

Example 5.7.7 From the tables above, only 1 and 5 have multiplicative inverses in Z 6 . In 
fact, 

11 = 1 and 5-5 = 1 in Z§ 

imply that l -1 = 1, and 5 _1 = 5 in Zq. On the other hand, every nonzero integer in Z7 has a 
multiplicative inverse: 

l -1 = 1, 2 -2 = 4, 3" 1 = 5, 4 _1 = 2, 5" 1 = 3, and 6" 1 = 6. 

Be sure to verify these inverses. A 

In general, given any set of numbers, we can define arithmetic operations in any way we like, 
provided that they obey certain rules. This produces an algebraic structure. For example, 
we call a set of elements S with two binary operations denoted © and 0 a field, and write 
(S, ©, ©} or (S, ©, ©), if it satisfies all seven properties listed above. Both (R, +, • ) and (Q, +, • ) 
are fields, but (Z, +, ■ ) is not, because multiplicative inverse of a does not exist if a ^ ±1. 

Theorem 5.7.4 The algebraic structure (Z n ,©,©) is a field if and only if n is prime. 

Proof: Verification of most of the properties is rather straightforward, with the exception of 
the existence of the multiplicative inverse, which we shall prove here. Since n is a prime, any 
a £ Z* must be relatively prime to n. Hence, 

as + nt = 1 

for some integers s and t. Modulo n, we find nt = 0, hence, as + nt = 1 becomes 

as = 1. 


Therefore a 1 = s (mod n). 
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The theorem tells us that if n is prime, then Z„ is a field, hence, every nonzero integer has 
a multiplicative inverse. 

Example 5.7.8 Determine 7 _1 (mod 29). 

Solution: We want to find a number a' such that 7a' = 1 (mod 29). Note that gcd(7, 29) = 1. 
Using extended Euclidean algorithm, we find 

7(— 4) + 29-1 = 1. 

Since 29 ■ 1 = 0 (mod 29), after reducing modulo 29, we find 

7(— 4) = 1 (mod 29). 

This implies that 7 _1 = —4 = 25 (mod 29). A 

When n is composite, Z„ is not a field. Then not every nonzero integer in it has a multi- 
plicative inverse. Of course, some special nonzero integers may still have multiplicative inverses. 

Hands-On Exercise 5.7.9 Determine 8” 1 (mod 45). 


A 


Example 5.7.9 Solve the equation 7x — 3 = 5 over Z 2 g. 

Solution: From 7x — 3 = 5, we find 7x = 8. Recall that what this equation really means is 

7x = 8 (mod 29). 

The answer is not x = because Z 2 g only contains integers as its elements. This is what we 
should do: multiply 7 _1 to both sides of the congruence: 

7~ 1 ■ 7x = 7" 1 • 8 (mod 29). 

Since 7 _1 • 7 = 1 (mod 29), we now have To simulate division, 

we have to multiply 

X = 7 _1 -8 (mod 29). by the multiplicative 

inverse. 

In a way, we use multiplicative inverse to simulate division. In this case, 7 _1 = 7 (mod 29). 

Hence, x = 7 ■ 8 = 26 (mod 29). A 

Hands-On Exercise 5.7.10 Solve the equation 8a; + 23 = 12 over Z45. 


A 


Example 5.7.10 Explain why 3 1 does not exist in Z 2 4 . 
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Solution: Suppose 3 _1 exists in Z 2 4, say, 3 _1 = z (mod 24). This means 3z = 1 (mod 24). 
Hence, 

3 z = 24 q + 1 

for some integer q. This in turn implies that 

1 = 3z — 24 q = 3 (z — 8q), 

which is clearly impossible because 2 — 8q is an integer. This contradiction shows that 3” 1 does 
not exist in Z 2 4. A 

Both R. and Q are infinite fields, while Z„ is a finite field when n is prime. The next result 
is a truly amazing one, because it proclaims that the number of elements in any finite field (one 
with finitely many elements) must be the power of a certain prime. Unfortunately, we are unable 
to prove it here, because it is beyond the scope of this course. 

Theorem 5.7.5 There exists a finite field ofn elements if and only if n is the power of a prime. 

Summary and Review 

• Modular arithmetic modulo n uses the mod operation to reduce the answers of all com- 
putation to within 0 through n — 1. 

• Instead of waiting until we obtain the final answer before we reduce it modulo n, it is 
easier to reduce every immediate result modulo n before moving on to the next step in the 
computation. 

• We can use negative integers in the intermediate steps. 

• The set of integers {0, 1, 2, . . . , n — 1}, together with modular arithmetic modulo n, is 
denoted as Z n . 

• For a • a' = 1 (mod n), we say that o' is the multiplicative inverse of a , and denote it a -1 . 

• For some a £ Z„, the multiplicative inverse a -1 may not exist. If it exists, we can use it 
to simulate division. 

Exercises 5.7 

1. Construct the addition and multiplication tables for Zg. Which nonzero elements have 
multiplicative inverses (reciprocals)? What are their multiplicative inverses? 

2. Repeat the last problem with Zg. 

3. Find the sum and product of 1053 and 1761 in Z17. 

4. Some of the results we derived earlier can be easily proven via modular arithmetic. For 
example, show that if an integer n is not divisible by 3, then n = ±1 (mod 3). What can 
you say about n 2 (mod 3)? Therefore what form must n 2 take? 

5. Show that no integer of the form m 2 + 1 is a multiple of 7. 

Hint : What are the possible values of m (mod 7)? Compare this to the last problem. 

6. What are the possible values of m (mod 13) such that m 2 + 1 is a multiple of 13? 

Hint: Compute m 2 + 1 (mod 13) for each value of m. 

7. Find the value of 4 45 in Z n 

(a) using the fact that 45 = 3 • 3 • 5 

(b) using repeated squaring 

8. Use repeated squaring to evaluate 5 23 (mod 11). 
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9. Solve these equations 

(a) 2x + 5 = 10 over Zi 3 

(b) 37x + 28 = 25 over Z 57 

(c) 12 — 2Ax = 15 over Z 35 

10. Let p and q be odd primes. 

(a) Show that p takes the form of either 6k + 1 or 6k + 5. 

Hint: First, explain why being odd restricts p to the form of 6k + 1, 6k + 3, and 
6k + 5. Next, argue why p ^ 6k + 3. 

(b) What could p be congruent to, modulo 24? 

(c) Show that if p > q > 5, then 24 | (p 2 — q 2 ). 

Hint: What are the possible values of p 2 and q 2 modulo 24? 

11. Use modular arithmetic to prove that, if is an integer not divisible by 5, then n 4 — 1 is 
divisible by 5. 

12. Use modular arithmetic to prove that 8 | (5 2n + 7) for any integer n > 0. 

13. Use modular arithmetic to prove that 3 | (2 2n — 1) for any integer n > 0. 

14. Use modular arithmetic to prove that 5 | (3 3n+1 + 2 n+1 ) for any integer n > 0. 
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Chapter 6 


Functions 


6.1 Functions: An Introduction 

The functions we studied in calculus are real functions, which are defined over a set of real 
numbers, and the results they produce are also real. In this chapter, we shall study their 
generalization over other sets. The definition could be difficult to grasp at the beginning, so we 
would start with a brief introduction. 

Most students view real functions as computational devices. However, in the generalization, 
functions are not restricted to computation only. A better way to look at functions is their 
input-output relationship. Let / denote a function. Given an element (which need not be a 
number), we call the result from / the image of x under f, and write f(x), which is read as 
“/ of®.” 

Imagine / as a machine. It takes the input value x, and returns f(x) as the output value. 
This input-output relationship is depicted in Figure 6.1 in two different ways. 

x 


I 



Figure 6.1: Two pictorial views of a function as a machine. 

The question is: how could we obtain f(x)l A function need not involve any computation. 
Consequently, we cannot speak of “computing” the value of /( x). Instead, we talk about what 
is the rule we follow to obtain f(x). This rule can be described in many forms. We can, of 
course, use a computational rule. But a table, an algorithm, or even a verbal description also 
work as well. 

When we say a real function is defined over the real numbers, we mean the input values must 
be real numbers. The output values are also real numbers. In general, the input and output 
values need not be of the same type. The nearest integer function, denoted [x], rounds 
the real number x to the nearest integer. Here, the images (the output values) are integers. 
Consequently, we need to distinguish the set of input values from the set of possible output 
values. We call them the domain and the codomain, respectively, of the function. 

Example 6.1.1 When a professor reports the final letter grades for the students in her class, 
we can regard this as a function g. The domain is the set of students in her class, and the 
codomain could be the set of letter grades {A, B,C,D,F}. A 
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We said the codomain is the set of possible output values, because not every element in the 
codomain needs to appear as the image of some element from the domain. If no student fails 
the professor’s class in Example 6.1.1, no one will receive the final grade F. The collection of the 
images (the final letter grades) form a subset of the codomain. We call this subset the range 
of the function g. The range of a function can be a proper subset of the codomain. Hence, the 
codomain of a function is different from the set of its images. If the range of a function does 
equal to the codomain, we say that the function is onto. 

Example 6.1.2 For the nearest integer function h(x) = [x], the domain is R. The codomain 
is Z, and the range is also Z. Hence, the nearest integer function is onto. A 

Example 6.1.3 Let x be a real number. The greatest integer function [xj returns the 
greatest integer less than or equal to x. For example, 

[750] = 7, L— 6 - 34 J = ~7, and |_ 15 J = 15- 

Therefore, [xj returns x if it is an integer, otherwise, it rounds x down to the next closest 

integer. Hence, it is also called the floor function of x. It is clear that its domain is R, and 

the codomain and range are both Z. A 

Hands-On Exercise 6.1.1 Let x be a real number. The least integer function \x] returns 
the least integer greater than or equal to x. For example, 

[750] =8, [—6.34] = —6, and [15] = 15. 

Thus, [x] returns x if it is an integer, otherwise, it rounds x up to the next closest integer. 
Hence, it is also called the ceiling function of x. What is its domain and codomain? 


A 


We impose two restrictions on the input-output relationships that we call functions. For 
any fixed input value x, the output from a function must be the same every time we use the 
function. As a machine, it spits out the same answer every time we feed the same value x to it. 
As a calculator, it displays the same answer on its screen every time we enter the same value x, 
and push the button for the function. We call the output value the image of x, and write /(x). 
The first important requirement for a function / to be well-defined is: the image /(x) is unique 
for any fixed x- value. 

A good machine must perform properly. In terms of a function /, we must be able to obtain 
/(x) for any value x (and, of course, produce only one result for each x). This is perhaps a 
little bit too demanding. A remedy is to restrict our attention to those x’s over which / would 
work. The set of legitimate input values is precisely what we call the domain of the function. 
Consequently, the second requirement says: for every element x from the domain, the output 
value /(x) should be well-defined. This is the mathematical way of saying that the value /(x) 
can be obtained. 


Example 6.1.4 Compare this to a calculator. If you enter a negative number and press the 
7” button, an error message will appear. To be able to compute the square root of a number, 
the number must be nonnegative. The domain of a function is the set of acceptable input values 
for which meaningful results can be found. For the square root function, the domain is R + U{0}, 
which is the set of nonnegative real numbers. A 
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Hands-On Exercise 6.1.2 For the square root function, we may regard its codomain as R. 
What is its range? Is the function onto? 


A 

Hands-On Exercise 6.1.3 For the square root function, can we say its domain is M + U 0? 
Explain. 


The two conditions for a function to be well-defined are often combined and written as if it 
were only one condition: 

A function f is well-defined if every element x from the domain has a unique image 
in the codomain. 

When you examine this definition closer, you will find the two separate requirements: 

• every element in the domain has an image under /, and 

• the image is unique. 

In the next section, we shall present the complete formal definition. 

Summary and Review 

• A function is a rule that assigns to every element in the domain a unique image in the 
codomain. 

Exercises 6.1 

1. Complete the following table: 


X 

5.7 

7 r 

e 

-7.2 

-0.8 

9 

W 

w 

[*] 








2. What is the domain and the codomain of the cube root function? Is it onto? 

3. For the square root function, how would you use the interval notation to describe the 
domain? 

4. For the square root function, which set complement would you use to describe the domain? 

6.2 Definition of Functions 


Definition. Let A and B be nonempty sets. A function from A to B is a rule that assigns 
to every element of A a unique element in B. We call A the domain , and B the codomain , of 
the function. If the function is called /, we write f: A — ► B. Given x £ A, its associated element 
in B is called its image under /. We denote it f(x), which is pronounced as “/ of x. n <0> 


Every element in the 
domain has a unique 
image. 
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Every element in the 
domain has a unique 
image. 


A function is sometimes called a map or mapping. Hence, we sometimes say / maps x to 
its image f(x). Functions are also called transformations. 

Example 6.2.1 The function /: {a, b, c} -» {1, 3, 5, 9} is defined according to the rule 
/(a) = 1, f(b) = 5, and /(c) = 9. 

It is a well-defined function. The rule of assignment can be summarized in a table: 


X 

a 

b 

c 

/O) 

i 

5 

9 


We can also describe the assignment rule pictorially with an arrow diagram , as shown in 
Figure 6.2. A 



Figure 6.2: An example of a well-defined function. 


The two key requirements of a function are 

• every element in the domain has an image under /, and 

• the image is unique. 

You may want to remember that every element in A has exactly one “partner” in B. 

Example 6.2.2 Figure 6.3 depicts two examples of non- functions. In the one on the left, one 
of the elements in the domain has no image associated with it. In the one on the right, one of 
the elements in the domain has two images assigned to it. Both are not functions. A 



Figure 6.3: Two types of non-functions. 


Hands-On Exercise 6.2.1 Do these rules 


X 

a 

b 

c 

f{x) 

5 

3 

3 


X 

b 

c 

9(x) 

9 

5 


X 

a 

b 

b 

c 

h{ x) 

i 

5 

3 

9 
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produce well-defined functions from {a, b, c} to {1,3, 5, 9}? Explain. 


Hands-On Exercise 6.2.2 Does the definition 

, n _ f x if today is Monday, 

' {2a; if today is not Monday 

produce a well-defined function from R to R? Explain. 


A 


Hands-On Exercise 6.2.3 Does the definition 


if x < 2, 
if x > 3, 


s(x) = 

produce a well-defined function from R to R? Explain. 


A 


A 


Example 6.2.3 The function /: [0, oo) — » R defined by 

fix) = Vx 

is well-defined. So is the function g: [2, oo) — > R defined as 

g( x) = \/x — 2. 

Can you explain why the domain is [2, oo)? ▲ 

Example 6.2.4 Let A denote the set of students taking Discrete Mathematics, and G = 
{A, B, C, D,F}, and £(x) is the final grade of student x in Discrete Mathematics. Every student 
should receive a final grade, and the instructor has to report one and only one final grade for 
each student. This is precisely what we call a function. ▲ 

Example 6.2.5 The function n: p({a,b,c,d}) — > Z is defined as n(S) = |S|. It evaluates the 
cardinality of a subset of {a, 6, c, <?}. For example, 


▲ 


Note that n(0) = 0. 


n({a,c}) = n({M}) = 2. 
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Exercise caution when 
the domain and 
codomain involve 
different moduli. 


Hands-On Exercise 6.2.4 Consider Example 6.2.5. What other subsets S of {a,b,c,d} also 
yield n(S) = 2? What are the smallest and the largest images the function n can produce? 


A 


Example 6.2.6 Consider a function /: Z 7 — > Z5. The domain and the codomain are, 

Z 7 = {0,1,2,3,4,5,6}, and Z 5 = {0, 1, 2, 3, 4}, 

respectively. Not only are their elements different, their binary operations are different too. In 
the domain Z 7 , the arithmetic is performed modulo 7, but the arithmetic in the codomain Z 5 is 
done modulo 5. So we need to be careful in describing the rule of assignment if a computation 
is involved. We could say, for example, 

f(x) = z, where z = 3x (mod 5). 

Consequently, starting with any element x in Z 7 , we consider x as an ordinary integer, multiply 
by 3, and reduce the answer modulo 5 to obtain the image f(x). For brevity, we shall write 

f(x) = 3x (mod 5). 


We summarize the images in the following table: 


n 

0 

1 

2 

3 

4 

5 

6 

f(n) 

0 

3 

1 

4 

2 

0 

3 


Take note that the images start repeating after /( 4) = 2. 

Hands-On Exercise 6.2.5 Tabulate the images of g: Z10 — » Z5 defined by 

g(x) = 3x (mod 5). 


A 


A 

Definition. The graph of a function f:A—>B is the set of ordered pairs (x,y) from Ax B 
such that y = f{x). <0> 

The graph of a function, in this general definition, may not look like the kind of graphs we 
expected from real functions. A graph is, by definition, a set of ordered pairs. 

Example 6.2.7 The graph of the function / in Example 6.2.6 is the set of ordered pairs 

{(0, 0), (1, 3), (2, 1), (3, 4), (4, 2), (5, 0), (6, 3)}. 

If one insists, we could display the graph of a function using an icy-plane that resembles the 
usual Cartesian plane. Keep in mind: the elements x and y come from A and B , respectively. 
We can “plot” the graph for / in Example 6.2.6 as shown below. 
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4 

3 

2 

1 


H 1 1 1 1 1 1 *- x 

0 1 2 3 4 5 6 


Besides using a graphical representation, we can also use a (0, l)-matrix. A (0, l)-matrix is 
a matrix whose entries are 0 and 1. For the function /, we use a 7 x 5 matrix, whose rows and 
columns correspond to the elements of A and B , respectively, and put one in the (i,j) th entry 
if j = f(i), and zero otherwise. The resulting matrix is 


0 

1 

2 

3 

4 

5 

6 


0 12 3 4 

/ 1 0 0 0 0 \ 

0 0 0 1 0 

0 10 0 0 
0 0 0 0 1 

0 0 10 0 

1 0 0 0 0 

\ 0 0 0 1 0 


We call it the incidence matrix for the function /. 


▲ 


Hands-On Exercise 6.2.6 “Plot” the graph of g in Hands-On Exercise 6.2.5. Also construct 
its incidence matrix. 


A 


Summary and Review 

• A function / from a set A to a set B (called the domain and the codomain, respectively) 
is a rule that describes how a value in the codomain B is assigned to an element from the 
domain A. 

• But it is not just any rule; rather, the rule must assign to every element x in the domain 
a unique value in the codomain. 

• This unique value is called the image of x under the function /, and is denoted f(x). 

• We use the notation /: A — y B to indicate that the name of the function is /, the domain 
is A , and the codomain is B. 

• The graph of a function /: A — >■ B is the collection of all ordered pairs (x, y) from Ax B 
such that y = f(x). 

• The graph of a function may not be a curve, as in the case of a real function. It can be 
just a collection of points. 

• We can also display the images of a function in a table, or represent the function with an 
incidence matrix. 
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Exercises 6.2 

1. What subset A of R would you use to make f:A — >• R defined by f(x ) = V 3a; — 7 a 
well-defined function? 


2. What subset 2 I of R would you use to make 

(a) g: A — >• R, where g(x) = \J [x — 3)(x — 7) 

(b) h: A — >• R, where /i(x) = X ^ 

v( x ~ 2 )(5-a;) 

well-defined functions? 

3. Which of these data support a well-defined function from {1, 2, 3, 4} to {1, 2, 3, 4}? Explain. 


X 

1 

2 

3 

/O) 

3 

4 

2 


X 

l 

2 

3 

4 

g{x) 

2 

4 

3 

2 


X 

1 

2 

3 

3 

4 

h{ x) 

2 

4 

3 

2 

3 


4. Which of the following are the graphical representation or incidence matrix of well-defined 
functions from {1,2, 3,4} to {1,2, 3, 4}? Explain. 


y 

4 -- 


/■ 


3 

2 


1 + 



9- 


1 


1 

2 

3 

4 


1 

Vo 


2 3 

1 0 
0 0 
0 0 
0 1 


4 

0 

0 

0 

0 


\ 


5. Determine whether these are well-defined functions. Explain. 

(a) f:R ->• R, where f(x) = ^ 

x H - o 

7 

(b) g: (5, oo) -A R, where g(x) = . 

V x — 4 

(c) fcR->R, where h(x) = —\/7 — Ax + 4a; 2 . 

6. Determine whether these are well-defined functions. Explain. 

(a) s: R — >■ R, where x 2 + [s(a;)] 2 = 9. 

(b) t: R — >■ R, where \x — t(x) \ = 4. 

7. Below are the graph of the function p and the incidence matrix for the function g, respec- 
tively, from {1,2, 3,4} to {1,2, 3, 4}. 

y 

12 3 4 

/ 0 1 0 0 \ 

0 0 10 

10 0 0 

\ 0 0 1 0 / 


H 1 1 h 

12 3 4 



X 
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Complete the following table: 


8. Let T be your family tree that includes your biological mother, your maternal grand- 
mother, your maternal great-grandmother, and so on, and all of their female descendants. 
Determine which of the following define a function from T to T. 

(a) hi'.T — > T, where hi(x) is the mother of x. 

(b) h 2 :T — > T, where h 2 (x) is x’s sister. 

(c) h$\T — > T, where /i 3 (x) is an aunt of x. 

(d) hi'.T — > T, where h^x) is the eldest daughter of x’s maternal grandmother. 

9. For each of the following functions, determine the image of the given x. 

(a) k\: N — {1} — > N, k\{x) = smallest prime factor of x, x = 217. 

(b) fc 2 :Zn — > Zn, k 2 (x) = 3x (mod 11), x = 6. 

(c) /c 3 :Z 15 — » Z 15 , fc 3 (x) = 3x (mod 15), x = 6. 

10. For each of the following functions, determine the images of the given x-values. 

(a) £x'.Z —> Z, £i(x) = x mod 7, x = 250, x = 0, and x = —16. 

Remark: Recall that, without parentheses, the notation “mod” means the binary 
operation mod. 

(b) i 2 : Z — > Z, liix) = gcd(x, 24), x = 100, x = 0, and x = —21. 

6.3 One-to-One Functions 

We distinguish two special families of functions: the one-to-one functions and the onto functions. 
We shall discuss one-to-one functions in this section, and onto functions in the next. 

Definition. A function /: A — > B is said to be one-to-one if 

xi ± x 2 => f(x i) ^ f(x 2 ) 

for all elements Xi,x 2 € A. A one-to-one function is also called an injection , and we call 
a function injective if it is one-to-one. A function that is not one-to-one is referred to as 

many-to-one. <£> 

Any well-defined function is either one-to-one or many-to-one. A function cannot be one- 
to-many because no element can have multiple images. The difference between one-to-one and 
many-to-one functions is whether there exist distinct elements that share the same image. There 
are no repeated images in a one-to-one function. 

Example 6.3.1 The identity function on any nonempty set A 

iA'- A —> A, ia(x) = x, 

maps any element back to itself, ft is clear that all identity functions are one-to-one. A 

Example 6.3.2 The function h: A — >■ A defined by h(x) = c for some fixed element c € A, is 
an example of a constant function. It is a function with only one image. This is the exact 
opposite of an identity function. It is clearly not one-to-one unless |A| = 1. A 


X 

1 

2 

3 

4 

q{x) 






X 

1 

2 

3 

4 

p(x) 






To be one-to-one, 
distinct elements 
must have distinct 
images. 
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Use this definition to 
prove that a function 
is one-to-one. 


For domains with a small number of elements, one can use inspection on the images to 
determine if the function is one-to-one. This becomes impossible if the domain contains a larger 
number of elements. 

In practice, it is easier to use the contrapositive of the definition to test whether a function 
is one-to-one: 

f(x i) = /( x 2 ) =>X!= x 2 . 


Example 6.3.3 Is the function /: M — > R defined by /( x) = 3x + 2 one-to-one? 

Solution: Assume f(x i) = /(x 2 ), which means 

3xi T 2 = 8x2 T 2. 

Thus 3xi = 3x2, which implies that Xi = x 2 . Therefore / is one-to-one. A 

Hands-On Exercise 6.3.1 Determine whether the function <7: M — K. defined by g(x) = 5— 7x 
is one-to-one. 


A 


Hands-On Exercise 6.3.2 Determine whether the function h: [2, 00) — > R defined by h(x) = 
yjx — 2 is one-to-one. 


A 

Interestingly, sometimes we can use calculus to determine if a real function is one-to-one. A 
real function / is increasing if 

Xi < x 2 => /(xi) < /(x 2 ), 

and decreasing if 

Xi < x 2 => /(x 1) > /(x 2 ). 

Obviously, both increasing and decreasing functions are one-to-one. From calculus, we know 
that 

• A function is increasing over an open interval (a, b) if f'(x ) > 0 for all x € (a, b). 

• A function is decreasing over an open interval (a, b) if f'(x) < 0 for all x € (a, b). 

Therefore, if the derivative of a function is always positive, or always negative, then the function 
must be one-to-one. 

Example 6.3.4 The function p: R — > R. defined by 

p(x) = 2x 3 — 5 

is one-to-one, because p'{x) = 6x 2 > 0 for any x € R*. Likewise, the function g:( — |,|)— 
defined by 

q(x ) = tanx 

is also one-to-one, because q'(x) = sec 2 x > 0 for any x £ ( — f , f ) ■ * 
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Hands-On Exercise 6.3.3 Use both methods to show that the function k: (0, oo) — > R defined 
by k(x) = In x is one-to-one. 


A 


Example 6.3.5 The function fi:R — »• R given by h( x) = x 2 is not one-to-one because some of 
its images are identical. For example, h{ 3) = h(— 3) = 9. It is a many-to-one function. Likewise, 
the absolute value function \x\ is not one-to-one. 

The functions p: [0, oo) — > R defined by p( x) = x 2 and q: [0, oo) —> R defined by q(x) = \x\ 
are one-to-one. Whether a function is one-to-one depends not only on its formula, but also on 
its domain. Consequently, sometimes we may be able to convert a many-to-one function into a 
one-to-one function by modifying its domain. ▲ 


You can use 
counterexamples to 
show that a function 
is not one-to-one. 


Example 6.3.6 Construct a one-to-one function from [1,3] to [2,5]. 


Solution: There are many possible solutions. In any event, start with a graph. We can use a 
straight line graph. The domain [1,3] lies on the rr-axis, and the codomain [2,5] lies on the 
y- axis. Hence the graph should cover the boxed region in Figure 6.4. 



Figure 6.4: Three candidates for one-to-one functions from [1,3] to [2,5]. 


Every number in 
the domain must 
have an image. 


V- 2 4-2 

x - 1 3-1 

The last step is to write the answer in the form of f(x ) = . . . . We have to express y in terms of 
x. We find y = x + 1. Hence, 


All three graphs do not produce duplicate images. We need to cover all x-values from 1 to 3 
in order for the function to be well-defined. This leaves only the first two graphs as legitimate 
examples. 

To determine the formula for /, we need to derive the equation of the line. Take the first 
graph as our choice. The line joins the point (1, 2) to the point (3, 4). Thus, its equation is 


/: [1)3] — ► [2,5], /(*)=* + 1 


▲ 


is an example of a one-to-one function. 
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Hands-On Exercise 6.3.4 Construct a one-to-one function from [1,3] to [2,5] based on the 
second graph in Example 6.3.6. 


A 

Hands-On Exercise 6.3.5 Construct a one-to-one function from [3,8] to [2,5]. 


A 


Example 6.3.7 Determine whether the function 3:^43 -A Z 43 defined by 

g( x) = 11 x — 5 (mod 43) 


is one-to-one. 

Solution: Assume g(x 1 ) = g(x 2 ). This means 

liar — 5 = lla :2 — 5 (mod 43), 


which implies 


llXi = 11X2 

Notice that 4 • 11 = 44 = 1 (mod 43), hence ll ” 1 
the last congruence yields 

44a: 1 = 44x 2 

which is equivalent to, since 44 = 1 (mod 43), 


(mod 43). 

= 4 (mod 43). Multiplying 4 to both sides of 
(mod 43), 


Xi = x 2 (mod 43). 

Therefore, Xi = x 2 in Z 43 . This proves that g is one-to-one. ▲ 

Hands-On Exercise 6.3.6 Is the function h: Z 15 — »• Z 15 defined by 

h(x) = 4a; — 11 (mod 15) 


a one-to-one function? 


A 
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Hands-On Exercise 6.3.7 Show that the function k: Z 15 -7 Z 15 defined by 

k(x) = 5# — 11 (mod 15) 
is not one-to-one by finding Xi 7 ^ X 2 such that k(x 1 ) = k(x 2 ). 


You can use a 
counterexample to 
show that a function 
is not one-to-one. 


A 

Example 6.3.8 In the last hands-on exercise, we should not rely on the non-existence of 5 _1 
in Z 15 to prove that k is not one-to-one. One must consider the interaction between the domain, 
the codomain, and the definition of the function. For example, despite the fact that 5 _1 does 
not exist in Z 15 , the function p: Z 3 —7 Z 15 defined by 

p(x) = 5x — 11 (mod 15) 

is one-to-one, because p{ 0) = 4, p(l) = 9, and p( 2) = 14 are distinct images. A 


The last example illustrates the trickiness in a function with different moduli in its domain 
and codomain. Use caution when you deal with such functions! Sometimes, infinite sets also 
pose a challenge. Because there is an infinite supply of elements, we may obtain results that 
appear to be impossible for finite sets. 


Example 6.3.9 The function /:Z — ► Z defined by 


Sin) 


§ if n is even 
if n is odd 


is not one-to-one, because, for example, /( 0) = /(— 1) = 0. The function </:Z — >• Z defined by 


g(n) = 2 n 


is one-to-one, because if < 7 ( 711 ) = < 7 ( 7 X 2 ), then 2n\ = 2 n 2 implies that n\ = n 2 . 


A 


Hands-On Exercise 6.3.8 Show that the function h: Z — > N defined by 


h(n) 


2n + 1 if n > 0, 
—2 n if n < 0, 


is one-to-one. 


A 

Example 6.3.10 Let A be the set of all married individuals from a monogamous community 
who are neither divorced nor widowed. Then the function s: A — > A defined by 

s(x) = spouse of x 

is one-to-one. The reason is, it is impossible to have X\ 7 ^ X2 and yet s(xi) = s(x2)- A 
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Summary and Review 

• A function / is said to be one-to-one if f(x 1 ) = /( x 2 ) => x\ = rr 2 . 

• No two images of a one-to-one function are the same. 

• To show that a function / is not one-to-one, all we need is to find two different x-values 
that produce the same image; that is, find x\ 7 ^ X 2 such that f(x 1 ) = f(x 2 ). 


Exercises 6.3 

1. Which of the following functions are one-to-one? Explain. 

(a) /: K. — > R, f(x) = x 3 — 2x 2 + 1. 

(b) g: [ 2, 00 ) — > R, f(x) = x 3 — 2x 2 + 1. 

2. Which of the following functions are one-to-one? Explain. 

(a) p:R->M, h{x) = e 1 ” 2 *. 

(b) q:M. — > R, p(x) = |1 — 3x|. 

3. Construct a one-to-one function /: (1,3) — > (2,5) so that /: [1,3) -A [2,5) is still one-to- 
one. 

4. Construct a one-to-one function g: [2,5) — > (1,4]. 

5. Determine which of the following are one-to-one functions. 


(a) /: Z — > Z; 

f(n) = n 3 + 1 

(b) 3:Q^Q; 

g{x) = n 2 

(c) h: R — > R; 

h(x) = x 3 — x 

(d) k: K — > K; 

k( x) = 5 X 


6. Determine which of the following are one-to-one functions. 

(a) p: p({ 1, 2, 3, ... , n}) ->{0,1,2,..., n}; p(S) = |5| _ 

(b) q: p({ 1, 2, 3, . . . , n}) -> p({l, 2, 3, ... , n}); q(S) = S 

7. Determine which of the following functions are one-to-one. 

(a) / 1 : {1,2, 3, 4, 5} -> {a,6,c,d}; /i(l) = 6, /i(2) = c, /i(3) = a, /i(4) = a, /i(5) = c 

(b) Jg: {1, 2, 3, 4} — > {a, b, c, d, e}; / 2 (1) = c, / 2 (2) = b, / 2 (3) = a, / 2 (4) = d 

(c) / 3 :Z -> Z; / 5 (n) = -n 

(d) > Z; A(») = {_^ 


8 . Determine which of the following functions are one-to-one. 


(a) g\\ {1, 2 , 3, 4, 5} 

(b) 52 : { 1 , 2 , 3, 4, 5} 

(c) 33 : N -> N; 

(d) (/ 4 : N — > N ; 


93 (n) = 

34 (n) = | 


{a, b, c, d, e}; ffi(l) = 6 , gi( 2 ) 
{a, b, c, d, e}; g 2 (l) = d, g 2 (2) 
( n + l )/2 if n is odd 
n /2 if n is even 

n + 1 if n is odd 
n — 1 if n is even 


6 , 31 (3) = 6 , 3 i( 4 ) = a, gi(5) = d 
b, 32 ( 3 ) = e, 32 ( 4 ) = a, 32 ( 5 ) = c 


9. List all the one-to-one functions from {1,2} to {a, b, c, d}. 
Hint : List the images of each function. 


10. Is it possible to find a one-to-one function from {1,2, 3, 4} to {1,2}? Explain. 

11. Determine which of the following functions are one-to-one. 
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(a) /:Z 10 — > Zio; /i(n) = 3n (mod 10). 

(b) g: Zio -» Zi 0 ; p(n) = 5n (mod 10). 

(c) h: Z 36 — » Z 36 ; /i(n) = 3?r (mod 36). 

12. Determine which of the following functions are one-to-one. 

(a) r:Z 36 — ► Z 36 ; r(n) = 5?i (mod 36). 

(b) s:Z 10 — »Z 10 ; s(n) = n + 5 (mod 10). 

(c) t: Zio — > Zio; t(n) = 3n + 5 (mod 10). 

13. Determine which of the following functions are one-to-one. 

(a) a:Z 12 — > Z7; a(n) = 2 n (mod 7). 

(b) /3: Z 8 — » Z 12 ; /3(n) = 3n (mod 12). 

(c) 7:Z 6 — > Z i2 ; 7(71) = 2n (mod 12). 

(d) 6 : Z12 — > Z 36 ; S(n) = 611 (mod 36). 

14. Give an example of a one-to-one function / from N to N that is not the identity function. 


6.4 Onto Functions 

One-to-one functions focus on the elements in the domain. We do not want any two of them 
sharing a common image. Onto functions focus on the codomain. We want to know if it contains 
elements not associated with any element in the domain. 


Definition. A function /: A — > B is onto if, for every element b £ B, there exists an element 
a £ A such that 


/(a) = b. 


An onto function is also called a surjection , and we say it is surjective. 


0 


Example 6.4.1 The graph of the piecewise-defined functions h: [1,3] — >• [2,5] defined by 


f 3x — 1 if 1 < x < 2, 
( — 3x + 11 if 2 < x < 3, 


is displayed on the left in Figure 6.5. It is clearly onto, because, given any y £ [2, 5], we can find 
at least one x £ [1, 3] such that h{x) = y. Likewise, the function k: [1, 3] -+ [2, 5] defined by 


( 3x — 1 if 1 < x < 2, 
\ 5 if 2 < x < 3, 


is also onto. Its graph is displayed on the right of Figure 6.5. A 

Hands-On Exercise 6.4.1 The two functions in Example 6.4.1 are onto but not one-to-one. 
Construct a one-to-one and onto function / from [1,3] to [2,5]. 


A 
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Figure 6.5: Two onto functions from [1,3] to [2,5]. 


Hands-On Exercise 6.4.2 Construct a function <7: [1, 3] — » [2,5] that is one-to-one but not 
onto. 


A 

Hands-On Exercise 6.4.3 Find a subset 5 of R that would make the function s:R — ► B 
defined by s(x) = x 2 an onto function. 


A 


Example 6.4.2 Construct a function g: (5, 8) — > K. that is both one-to-one and onto. 

Remark. This is a challenging problem. Since the domain is an open interval, a straight line 
graph does not work, because it will not cover every number in the codomain. <0> 

Solution: The solution is based on the observation that the function h: (— f , f ) — > R defined 
by h{x) = tan 2 is one-to-one and onto. For this to work in this problem, we need to shift and 
scale the interval (5, 8) to the same size as (— f , §). 

First, we have to shift the center of the interval (5,8) to the center of the interval (— f , f )■ 
The midpoint of the interval (5, 8) is AA = and the midpoint of (—§,§) is 0. Hence, we need 
to shift the interval (5, 8) to the left units. This means we need to use the transformation 
x — The two endpoints 5 and 8 become — § and | , respectively: 


X 

5 

13 

2 

8 

- 13 

^ 2 

3 

2 

0 

3 

2 


After the transformation x — -y , the original interval (5, 8) becomes the interval (— |, |). Next, 
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we want to stretch the interval (—§,§) into (—§,§)• This calls for a scaling factor of J. 


X 

5 

13 

2 

8 

f (*-¥) 

7T 

2 

0 

7T 

2 


Putting these transformations together, we conclude that 

g(x) = tan 

gives a one-to-one and onto function from (5, 8) to 


7 t ( 13 

3 ( y 


Hands-On Exercise 6.4.4 Construct a function h:( 2,9) 
onto. 


that is both one-to-one and 


A 

In general, how can we tell if a function f:A—>B is onto? The key question is: given an 
element y in the codomain, is it the image of some element x in the domain? If it is, we must 
be able to find an element x in the domain such that f(x) = y. Mathematically, if the rule of 
assignment is in the form of a computation, then we need to solve the equation y = f(x) for 
x. If we can always express x in terms of y, and if the resulting auvalue is in the domain, the 
function is onto. 

Example 6.4.3 Is the function p: R — > R defined by p(x) = 3a: 2 — Ax + 5 onto? 

Solution 1: Let y = 3a: 2 — 4a; + 5, we want to know if we can always express x in terms of y. 
Rearranging the equation, we find 

3a; 2 — 4a; + (5 — y) = 0. 

We want this equation to be solvable over R, that is, we want its solutions to be real. This 
requires its discriminant to be nonnegative. So we need 

(— 4) 2 - 4 • 3 • (5 - y) = 12y - 44 > 0. 

We have real solutions only when y > . This means, when y < ir. we cannot find an a;- value 

such that p(x) = y. Therefore, p is not onto. A 

Solution 2: By completing the square, we find 

p(x) = 3x 2 — 4x + 5 = 3 ^x 

Since p(x) -ft 4^, it is clear that p is not onto. 

Hands-On Exercise 6.4.5 The function g: R — > R is defined as g{x) = 3x + 11. Prove that it 
is onto. 


2 \ 11 11 

- H > — . 

3 3 ~ 3 


A 
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Example 6.4.4 Is the function p:M. — > K. defined by 


( 4a: + 1 if x < 3 
y \ x if x > 3 


an onto function? 

Solution: The graphs y = 4x + 1 and y = | x are both increasing. For x < 3, the y-values 
cover the range (— oo, 13), and for x > 3, the y-values cover the range (|,oo). Since these two 
y-ranges overlap, all the y-values are being covered by the images. Therefore, p is onto. A 

Hands-On Exercise 6.4.6 Determine whether 

r , x f 3x + 1 if x < 2 

/w = i4* a*; 2 


is an onto function. 


A 


Example 6.4.5 Consider the function -A Z 43 defined by 

g(x) = 11a: — 5 (mod 43). 


Let 

then 

This shows that g is onto. 


y = g(x) = 11a: — 5 (mod 43), 
x = ll~ 1 (y + 5) = 4{y + 5) (mod 43). 


A 


Hands-On Exercise 6.4.7 Show that the function h:h 23 —> ^23 defined by h(x) = 5x + 8 
(mod 23) is onto. 


A 


Example 6.4.6 Is the function u:Z — > Z defined by 


u(n) 


(2 n if n > 0 
l — n if n < 0 


one-to-one? Is it onto? 


Solution: Since u(— 2) = w(l) = 2, the function u is not one-to-one. Since u(n) > 0 for any 
n £ Z, the function u is not onto. A 
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Hands-On Exercise 6.4.8 Is the function icN — > N defined by v(n) = n + 1 onto? Explain. 


A 

Example 6.4.7 The function s in Example 6.3.10 is both one-to-one and onto. It provides 
a one-to-one correspondence between the elements of A by matching a married individual to 
his/her spouse. ▲ 

Hands-On Exercise 6.4.9 Is the function hi in Exercises 6.2, Problem 8, an onto function? 
Explain. 


A 


Summary and Review 

• A function f:A—>B is onto if, for every element b £ B, there exists an element a £ A 
such that /(a) = b. 

• To show that / is an onto function, set y = f(x), and solve for x, or show that we can 
always express x in terms of y for any y £ B. 

• To show that a function is not onto, all we need is to find an element y £ B, and show 
that no rr-value from A would satisfy f(x) = y. 

Exercises 6.4 

1. Which of the following functions are onto? Explain! 

(a) /: K. — > R, f(x) = x 3 — 2x 2 + 1. 

(b) g: [2, oo) — * R, g(x) = x 3 — 2x 2 + 1. 

2. Which of the following functions are onto? Explain! 

(a) p: R. — > R, p(x) = e 1 ~ 2x . 

(b) q: R — > R, q(x) = |1 — 3x|. 

3. Construct a one-to-one function /: [1,3] — > [2,5] that is not onto. 

4. Construct an onto function g: [2,5) — > (1,4] that is not one-to-one. 

5. Determine which of the following are onto functions. 


(a) /: Z — ► Z; 

f(n) = n 3 + 1 

(b) g: Q — ¥ Q; 

g(x) = n 2 

(c) h: R — > K; 

h{x) = x 3 — x 

(d) k: K — > K; 

k(x) = 5 X 


6. Determine which of the following are onto functions. 

(a) p: p({ 1, 2, 3, ... , n}) — > {0, 1,2,..., n}; p(S) = |5| _ 

(b) q: p({ 1, 2, 3, . . . , n}) -> p({l, 2, 3, . . . , n}); q(S) = S 

7. Determine which of the following functions are onto. 


(a) /i: {1,2, 3, 4, 5} {a,b,c,d}; /i (1) = b , /i(2) = c, /i(3) = a, /i(4) = a, /i(5) = c 

(b) / 2 : {1,2, 3, 4} -> {a,b,c,d,e}; / 2 (1) = c, / 2 (2) = b, / 2 (3) = a, / 2 (4) = d 
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(c) / 3 :Z-) Z; / 5 (n) = -n 


8. Determine which of the following functions are onto. 


(a) si: {1, 2, 3, 4, 

(b) S2 ; {!) 2, 3, 4, 

(c) c/ 3 : N -A N; 


5} 

5} 

93(n) = 


{ a,b,c,d,e }; si(l) = b , si(2) = b , si(3) 
{a, b , c, d, e}; 52(1) = d, S2(2) = 6, #2(3) 
(n + l)/2 if n is odd 
n/2 if n is even 


(d) S4 : N — ► N ; 



if n is odd 
if n is even 


b, Si (4) = a, si (5) = d 
e, S2(4) = a, S2(5) = c 


9. Is it possible for a function from {1, 2} to {a, 6, c, d} to be onto? Explain. 


10. List all the onto functions from {1, 2, 3,4} to {a, fo}? 
Hint : List the images of each function. 


11. Determine which of the following functions are onto. 

(a) /: Z10 — » Z10; h(n) = 3 n (mod 10). 

(b) g:Z 10 Z 10 ; g{n) = 5 n (mod 10). 

(c) h:Z 36 Z 36 ; h(n) = 3 n (mod 36). 

12. Determine which of the following functions are onto. 

(a) r:Z 36 -A Z 36 ; r(n) = 5 n (mod 36). 

(b) s: Z10 — t Z10; s(n) = n + 5 (mod 10). 

(c) t: Z10 — > Z10; t(n) = 3n + 5 (mod 10). 

13. Determine which of the following functions are onto. 

(a) a:Z 12 — > Z7; a(n) = 2 n (mod 7). 

(b) /3: Z 8 — > Zi 2 ; /3(n) = 3n (mod 12). 

(c) Z12; 7 (n) = 2n (mod 12). 

(d) S:Z ±2 — > Z 33 ; d(n) = 6n (mod 36). 

14. Give an example of a function /: N — ► N that is 


(a) neither one-to-one nor onto (b) one-to-one but not onto 

(c) onto but not one-to-one (d) both one-to-one and onto 


6.5 Properties of Functions 

In this section, we will study some properties of functions. To facilitate our discussion, we 
need to introduce some notations. Some students may find them confusing and difficult to use. 
Besides memorizing the definitions, try to understand what they really mean. 

Definition. Given a function /: .4 -A B. and CCA, the image of C under f is defined as 

/(C) = {/(*) | xeC}. 

In words, /(C) is the set of all the images of the elements of C. <(> 


Remark. A few remarks about the definition: 
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1. It is about the image of a subset C of the domain of A. Do not confuse it with the image 
of an element x from A. 

2. Therefore, do not merely say “the image.” Be specific: the image of an element, or the 
image of a subset. 

3. Better yet: include the notation /( x) or /(C) in the discussion. 

4. While f(x) is an element in the codomain, /(C) is a subset of the codomain. 

5. Perhaps, the most important thing to remember is: 

If y £ /(C), then y £ B, and there exists an x £ C such that f(x) = y. 


Remember this! 


This key observation is often what we need to start a proof with. <£> 

Definition. Let f: A — > B be a function. The image or range of /, denoted ini /, is defined 
as the set f(A). Hence, im / is the set of all possible images that / can assume. <0> 

The definition implies that a function f:A — > B is onto if im/ = B. Unfortunately, this 
observation is of limited use, because it is not always easy to find im /. 

Example 6.5.1 For the function /:R — t R. defined by 

f(x) = x 2 , 

we find im/ = [0, oo). We also have, for example, /([ 2,oo)) = [4, oo). It is clear that / is 
neither one-to-one nor onto. ▲ 

Example 6.5.2 For the function g:h — > Z defined by 

g (n) = n + 3, 

we find im g = Z, and p(N) = {4, 5, 6 , . . .}. The function g is both one-to-one and onto. ▲ 

Hands-On Exercise 6.5.1 The function p:R. — > R is defined as p(x) = 3a; + 11. Find p(K + ) 
and imp. 


A 


Hands-On Exercise 6.5.2 


The function < 7 : K — >■ K is defined as q( x) = x 2 — x — 7. Find im q. 


A 


Example 6.5.3 The function h: Z 15 — ► Z 15 is defined by 

h(x) = 5x — 11 (mod 15). 


X 

0 

1 

2 

3 

4 

5 

6 


14 

f(x) 

4 

9 

14 

4 

9 

14 

4 


14 


From the tabulated data 
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it becomes clear that the images repeat the pattern 4, 9, 14 five times. Therefore, we determine 
that irn h = {4, 9, 14}. A 


Hands-On Exercise 6.5.3 Determine h({0, 3,4}), where h is defined in Example 6.5.3. 


A 

Example 6.5.4 Determine /({(0, 2), (1, 3)}), where the function /: {0,1,2} x {0,1,2, 3} — ¥ Z 
is defined according to 

/(a, b) = a + b. 

Remark: Strictly speaking, we should write /((a, b )) because the argument is an ordered pair 
of the form (a, 6). However, we often write /(a, 6), because / can be viewed as a two-variable 
function. The first variable comes from {0, 1, 2}, the second comes from {0, 1, 2, 3}, and we add 
them to form the image. 

Solution: Because 

/( 0,2) = 0 + 2 = 2, and /( 1, 3) = 1 + 3 = 4, 

we determine that /({( 0,2), (1,3)}) = {2,4}. A 

Hands-On Exercise 6.5.4 Find im /, where / is defined in Example 6.5.4. 


A 


We are now ready to present the first collection of properties of functions. 

Theorem 6.5.1 Given f:A^B, the following properties hold for any C\ , C 2 C A. 

(a) f(C 1 UC 2 ) = f(C 1 )Uf(C 2 ) 

(b) /(C 1 ! n C 2 ) c f{c x ) n f(C 2 ) 

(c) /(Ci - C 2 ) D f(Ci) - /(C 2 ) 

(d) C\ C C 2 /(Ci) C /(C 2 ) 

Remark. These results provide excellent opportunities to learn how to write mathematical 
proofs. We only provide the proof of (a) below, and leave the proofs of (b)--(d) as exercises. In 
(a), we want to establish the equality of two sets. One way to prove that S = T is to show that 
S C T, and T C S. Now, in order to prove that S C T, we need to show that z € S implies 
z GT; to show that T C S', we want to prove that z € T implies z £ s. <0> 

Proof of (a): First, we want to show that /(Ci U C 2 ) C f(Ci) U /(C 2 ). Let y G f{C\ U C 2 ), 
then there exists x G Ci U C 2 such that f(x) = y. Having x G C\ U C 2 means either x G C\ or 
iGC 2 , so we have to consider two cases. 

• If x G Ci, then /( x) G /(Ci). 

• If x G C 2 , then f (x) G /(C 2 ). 
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Thus, y = }{x) belongs to either f{C\) or /(C 2 ), which means y = f(x) G /(Ci) U /(C 2 ). This 
proves that /(Ci U C 2 ) C /(Cl) U /(C' 2 ). 

Next, we want to show that /(Ci) U /(C' 2 ) C /(Ci U C 2 ). Let y G /(Ci) U /(C 2 ), then y 
belongs to either /(Ci) or /(C 2 ). 

• If y G /(Ci), then there exists Xi G Ci such that /(#i) = y. 

• If y G /(C 2 ), then there exists X 2 G C 2 such that f{x 2 ) = y. 

These two possibilities together imply that there exists an element x belonging to either Ci or 
C 2 , that is, x G Ci U C 2 , such that f(x) = y. This means f(x) G /(Cl U C 2 ). This proves that 

/(Ci) U /(C 2 ) C /(Ci U C 2 ). This concludes the proof of f{C 1 U C 2 ) = /(Ci) U /(C 2 ). ■ 

Hands-On Exercise 6.5.5 Prove part (b) of Theorem 6.5.1. 


A 

Remark. Part (b) of Theorem 6.5.1 only gives a subset relationship. The reason is: having 
y G /(Ci) and y G /(C 2 ) does not necessarily mean that y is the image of the same element. 
Since / can be many-to-one, it is possible to have x± G Ci — C 2 and i 2 G C 2 - Ci such that 
f(x 1 ) = f(x 2 ) = y. Consider /: {1, 2, 3} — > {a, b} defined by 

/(l) = /(3) = «, and /( 2) = b. 

If Ci = {1,2} and C 2 = {2,3}, then /(Ci) = /(C 2 ) = {a, 6}, and 

/(Ci n c 2 ) = / ({ 2 }) = {6} c {«, &} = /(Ci) n /(c 2 ). 

Therefore, we can only conclude that y G /(Ci (~l C 2 ) 4yG /(Ci) (~l /(C 2 ). <0> 

Definition. Given a function and D C B, the preimage of D under f is defined 

as 

f~\D) = {x G A | /(*) G D}. 

Hence, f~ 1 (D) is the set of elements in the domain whose images are in C. The symbol f~ 1 (D) 
is also pronounced as “/ inverse of D." <0> 

Remark. Some remarks about the definition: 

1. The preimage of D is a subset of the domain A. 

2. In particular, the preimage of B is always A. 

3. The key thing to remember is: 

If x G / _1 (D), then iGd, and f(x) G D. 

4. It is possible that / _1 (D) = 0 for some subset D. If this happens, / is not onto. 

5. Therefore, / is onto if and only if / -1 ({&}) 0 for every b G B. <(> 
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Example 6.5.5 If t: K. — » K. is defined by t(x) = x 2 — 5x + 5, find t 1 ({ — 1})- 

Solution: We want to find x such that t(x) = x 2 — 5a; + 5 = — 1. Hence, we have to solve the 
equation 

0 = x 2 — 5x + 6 = (x — 2) (x — 3) . 

The solutions are x = 2 and x = 3. Therefore, t _1 ({— 1}) = {2,3}. A 

Hands-On Exercise 6.5.6 If k:Q — > K. is defined by k(x) = x 2 — x — 7, find fc -1 ({3}). 


A 


Example 6.5.6 For the function /: {0, 1, 2} x {0, 1, 2, 3} — > Z defined by 

f(a,b) = a + b, 

we find 

/ _1 ({3}) = {(0,3), (1,2), (2,1)}, 

/ _1 ({4}) = {(1,3), (2, 2)}. 

Since preimages are sets, we need to write the answers in set notation. A 

Hands-On Exercise 6.5.7 Find ft -1 ({4}) and /i _1 ({ 2}), where the function h is defined in 
Example 6.5.3. 


Theorem 6.5.2 . Given and D\ 1 D 2 C B , the following properties hold. 

(a) U D 2 ) = f-^D,) U f~ 1 (D 2 ) 

(b) f~ 1 {D l n D 2 ) = f~\D 1) n f~ 1 (D 2 ) 

(c) f~\D l - D 2 ) = / _1 (£>i) - f~\D 2 ) 

(d) DiCD^/^C/- 1 ^) 


Proof of (a): First, we want to prove that / 1 (I?i UU 2 ) C / 1 (I?i) U / 1 (D 2 ). Let x € 
f~ 1 {Di U D 2 ), then f(x) € D\ U D 2 . This means either f(x) G D\ or f(x) € D 2 . 

• If f(x) € Di, then x G 

• If f(x) G D 2 , then x G f~ 1 (D 2 ). 

Since x belongs to either f~ 1 (D 1 ) or f~ 1 (D 2 ), we determine that x G f~ l {D 1 ) U f~ 1 {D 2 ). 
Therefore, f~ 1 (D 1 U D 2 ) C / _1 (D 1 ) U f~ 1 { D 2 ). 

Next, we want to prove that f~ 1 (Di)Uf~ 1 (D 2 ) C / _1 (UiUH 2 ). Let x G f~ 1 (Di)Uf~ 1 (D 2 ). 
Then x belongs to either f~ 1 (D 1 ) or x G f~~ 1 (D 2 ). 

• If x G f~ 1 (Di), then f(x) G D\. 

• If x G f~ 1 (D 2 ), then f \x) G D 2 . 
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Hence, f(x) belongs to either D\ or D 2 , which means /( x) € D\ U-D 2 . Thus, x € f~ 1 {D\ L)D 2 ). 
We have proved that U f~ 1 {D 2 ) C / _1 (U 1 U D 2 ). Together with U D 2 ) C 

/ _1 (Di) U f~ 1 (D 2 ), we conclude that f~ 1 (D 1 U D 2 ) = / _1 (£> i) U f~ 1 (D 2 ). ■ 

Hands-On Exercise 6.5.8 Prove part (b) of Theorem 6.5.2. 


A 

Whether a function /: A — > B is one-to-one or onto can be determined by the cardinality of 
the preimages. 

• / is one-to-one if and only if |/ -1 ({&})| < 1 for every b € B. 

• / is onto if and only if |/ -1 ({&})| > 1 for every b € B. 

If A and B are finite sets, then 

• | -4 1 < \B\ if / is one-to-one, and 

• |-4| > \B\ if / is onto. 

In particular, if / is one-to-one and onto, we have \A\ = \B\. 

Example 6.5.7 A function f:Z 14 — > Z 10 cannot be one-to-one because in order for it to be 
one-to-one, we need 14 distinct images. Since the codomain has only 10 elements, it is impossible 
for it to come up with 14 different images. 

Likewise, a function g: Z 23 Z 57 cannot be onto because the domain has 23 elements, hence, 

we can have at most 23 different images. But the codomain has 57 elements, therefore, some of 
its elements must be left unused. A 

Example 6.5.8 Consider the function h: Z 23 — > Z 57 defined by 

h(x) = 43x (mod 57). 

If y = 43a; (mod 57), then, since 43 -1 = 4 (mod 57), we find, in Z 23 , 

x = 43 _ 1 j/ = 4 y. 
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Be sure you know 
which modulus you 
are using. 


Since we can also express x in terms of y, we declare that / is onto. Yet, we have learned from 
the previous example that / cannot be onto. Is there any contradiction? 

Solution: There is an error in the argument. We should have said 

x = 43 _1 y = 4y (mod 57). 

Since x is reduced modulo 57, its value may exceed 23. If this happens, x ^ Z 23 . For example, 
if y = 11, we would have x = 44 ^ Z 23 . Even if we reduce 44 modulo 23, we obtain x = 21 
(mod 23), we would have 

43-21 = 48 ^11 (mod 57). 

So it is still not the correct preimage. This example again illustrates the importance of taking 
caution when a function involves different moduli in its domain and codomain. A 


Summary and Review 

• Given a function /: A -A B, the image of C C A is defined as f(C ) = {/(a;) | x £ C}. 

• If y £ /(C), then y € B, and there exists an x £ C such that f(x) = y. 

• See Theorem 6.5.1 for a list of properties of the image of a set. 

• The preimage of D C B is defined as f~ 1 (D) = {x G A \ f(x) € D}. 

• If x € f~ 1 (D), then x € A, and f(x) € D. 

• See Theorem 6.5.2 for a list of properties of the preimage of a set. 


Exercises 6.5 


1. For each of the following functions, find the image of C, and the preimage of D. 

(a) / 1 : {1,2, 3, 4, 5} -> {a, 6 ,c,d}; /i(l) = 6 , /i(2) = c, /i(3) = a, /i(4) = a, /i(5) = c; 

C = {1, 3}, D = {a.c}. 

(b) / 2 : {1,2, 3, 4} — > {a,b,c,d,e}; / 2 (1) = c, / 2 (2) = b, / 2 (3) = a, / 2 (4) = d; 

<7= {1,3}, D = {b,d}. 

(c) / 3 : {1,2, 3, 4, 5} {a, 6 ,c,d,e}; / 3 ( 1) = 6 , / 3 ( 2) = b , / 3 ( 3) = b , / 3 (4) = o, / 3 (5) = d; 

C = {1, 3, 5}, D = {c}. 

(d) / 4 : {1,2, 3, 4, 5} {a, 6 ,c,d,e}; / 4 (1) = d, / 4 (2) = 6 , / 4 (3) = e, / 4 (4) = a, / 4 (5) = c; 

C={3}, i? = {c}. 


2. For each of the following functions, find the image of C, and the preimage of D. 

; C = 2Z, D = N. 

2 n if n < 0 , 


(a) E'. Z — >• Z; 

/5W 

(b) fe: Z — > Z; 

/e(n) 

(c) /y:N^N; 

/r(n) 

(d) / 8 :N^N; 

/«(«) 

The function s: Z 

12 ~ > Z; 


—3 n if n > 0 ; 


C = N, D = 21. 


(n + l )/2 if n is odd, 
n /2 if n is even; 

n + 1 if n is odd, 
n — 1 if n is even; 


C = D = 2N. 

C = D = 2N. 


s(x) = 4x + 7 (mod 12). 


(a) Find s({2,5,7». 

(b) Find s _ 1 ({2, 5, 7}). 

(c) Find im s. 
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4. The function t : Z15 — x Z15 is defined as 

t(x) = 3x 2 — 5 (mod 15). 


(a) Find f({2, 3, 5, 13}). 

(b) Find t -1 ({l, 5, 7}). 

(c) Find im t. 

5. The function u: ffi. — x K. is defined as u(x) = 3x + 11, and the function v: Z — x K. is defined 
as v(x ) = 3x + 11. 

(a) Find u([3,5)) and n({3,4, 5}). 

(b) Find u _1 ((2, 7 ]) and v~ 1 ((2, 7 ]). 

6 . Is the function h: Z — x Z defined by 



if n > 0 
if n < 0 


one-to-one? Is it onto? 


7. Define the r:ZxZ->Q according to r(m, n) = 3 m 5". 

(a) Find r({l, 2, 3} x {—1, 0, 1}). 

(b) Find r" 1 

(c) Find r _1 (D), where D = {3, 9, 27, 81, . . . }. 

8. Define the function p: ZxZ-)Z according to p(x, y ) = 12x + 15 y. 

(a) Find p -1 ({18}). You may use the set-builder notation to describe your answer. 

(b) Find imp. 

9. The sum of the entries in a particular row in a matrix is called a row sum, and the sum of 
the entries in a particular column is called a column sum. Discuss how can we use the row 
sums and column sums of the incidence matrix of a function to determine if the function 
is well-defined, one-to-one, and onto. 


10. Below is the incidence matrix of the function /: {a, b , c, d, e } —X {a, /3, 7, 6 , e}: 


a 

b 

c 

d 

e 



a 

P 

7 

6 

e 


/ 

0 

0 

0 

0 

1 

\ 


0 

0 

0 

1 

0 



1 

0 

0 

0 

0 



0 

0 

1 

0 

0 


V 

1 

0 

0 

0 

0 

J 


(a) Find f({a,d,e}). 

(b) Find /3,e}). 

(c) Findim/. 

11. Consider the function hi defined in Problem 8a in Exercises 6.2. What is fi)” 1 ({m}), if m 
represents your mother? 


12. Let S denote the maternal family tree, that includes you, your mother, your maternal 
grandmother, your maternal great-grandmother, and so on. Define a function M:S —X S 
by letting M (x) be the mother of x. Determine imM. 

13. Prove part (c) of Theorem 6.5.1. 
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Pronounce f 1 as 
“f inverse.” 


14. Prove part (c) of Theorem 6.5.2. 

15. (a) Prove part (d) of Theorem 6.5.1. 

(b) Prove part (d) of Theorem 6.5.2. 

16. Construct an example of a function f:A -A B, and C \ , C '2 C A such that f(C\ — C 2 ) 2 
/(Ci) — /(C 2 ). See part (c) of Theorem 6.5.1. 

17. Given a function f:A—>B, and C C A, since /(C) is a subset of B, the preimage of this 
subset is indicated by the notation / -1 (/(C)). Consider the function /: Z — > Z defined by 
/( x) = x 2 , and C = {0, 1, 2, 3}. 

(a) Find /(C). 

(b) Find/" 1 (/(C)). 

18. Prove that C C / _1 (/(C)) for any function f-.A—^-B, and CCA. 

6.6 Inverse Functions 

A bijection is a function that is both one-to-one and onto. Naturally, if a function is a bijection, 
we say that it is bijective. If a function /: A -A B is a bijection, we can define another function 
g that essentially reverses the assignment rule associated with /. Then, applying the function g 
to any element y from the codomain B , we are able to obtain an element x from the domain A 
such that f(x) = y. Let us refine this idea into a more concrete definition. 

Definition. Let /: A — ► B be a bijective function. Its inverse function is the function 
f~ x :B — ► A with the property that 


/ 1 (6) =a<$b = /(a). 

The notation /- 1 is pronounced as “/ inverse.” See Figure 6.6 for a pictorial view of an inverse 
function. 



Figure 6.6: The pictorial view of an inverse function. 

Why is / _1 : B — > A a well-defined function? For it to be well-defined, every element b C B 
must have a unique image. This means given any element b £ B, we must be able to find one 
and only one element a £ A such that f(a) = b. Such an a exists, because / is onto, and there 
is only one such element a because / is one-to-one. Therefore, /- 1 is a well-defined function. 

If a function / is defined by a computational rule, then the input value x and the output 
value y are related by the equation y = f(x). In an inverse function, the role of the input and 
output are switched. Therefore, we can find the inverse function /- 1 by following these steps: 

(i) Interchange the role of x and y in the equation y = f{x). That is, write x = f{y). 



6.6 Inverse Functions 


185 


(ii) Solve for y. That is, express y in terms of x. The resulting expression is / : (x). 

Be sure to write the final answer in the form / _1 (x) = . . . . Do not forget to include the domain 
and the codomain, and describe them properly. 

Example 6.6.1 To find the inverse function of /:R — > R defined by f(x) = 2x + 1, we start 
with the equation y = 2x + 1. Next, interchange x with y to obtain the new equation 

x = 2y + 1. 

Solving for y, we find y = \ {x — 1). Therefore, the inverse function is 

f~ 1 (x) = ^ (x - 1). 

It is important to describe the domain and the codomain, because they may not be the same as 
the original function. ▲ 

Example 6.6.2 The function s: [ — f , f ] — > [—1, 1] defined by s( x) = sins is a bijection. Its 
inverse function is 

s -1 : [—1, 1] — » [ — f j f] , s _1 (x) = arcsinx. 

The function arcsinx is also written as sin -1 x, which follows the same notation we use for 
inverse functions. ▲ 

Hands-On Exercise 6.6.1 The function /: [— 3, oo) — > [0, oo) is defined as /(x) = y/x + 3. 
Show that it is a bijection, and find its inverse function. 


A 


Hands-On Exercise 6.6.2 Find the inverse function of g: R — > (0, oo) defined by g(x) = e x . 


A 

Remark. Exercise caution with the notation. Assume the function f:Z — »• Z is a bijection. 
The notation / _1 ( 3) means the image of 3 under the inverse function / -1 . If f~ 1 ( 3) = 5, we 
know that /( 5) = 3. The notation / _1 ({3}) means the preimage of the set {3}. In this case, we 
find ,/ _1 ({3}) = {5}. The results are essentially the same if the function is bijective. 

If a function g: Z — > Z is many-to-one, then it does not have an inverse function. This makes 
the notation g _1 ( 3) meaningless. Nonetheless, g -1 ({3}) is well-defined, because it means the 
preimage of {3}. If <7 1 ({3}) = {1,2,5}, we know g( 1) = g( 2) = g( 5) = 3. 

In general, f~ 1 (D) means the preimage of the subset D under the function /. Here, the 
function / can be any function. If / is a bijection, then f^ 1 (D) can also mean the image of the 
subset D under the inverse function / . There is no confusion here, because the results are 

the same. <0> 


Example 6.6.3 The function /:! 


is defined as 


fix) = | 


3x 

2x A 1 


if x < 1, 
if x > 1. 


Find its inverse function. 
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Solution: Since / is a piecewise-defined function, we expect its inverse function to be piecewise- 
defined as well. First, we need to find the two ranges of input values in / -1 . The images for 
x < 1 are y < 3, and the images for x > 1 are y > 3. Hence, the codomain of /, which becomes 
the domain of f~\ is split into two halves at 3. The inverse function should look like 


f~\x) = { 


??? 

??? 


if x < 3, 
if x > 3. 


Next, we determine the formulas in the two ranges. We find 


r\x) 


| a: if x < 3, 

\{x — 1) if x > 3. 


The details are left to you as an exercise. 

Hands-On Exercise 6.6.3 Find the inverse function of g: 1 


defined by 


d(x) = { 


3a; + 5 if x < 6, 
5x — 7 if x > 6. 


Be sure you describe g 1 properly. 


A 

Example 6.6.4 The function g:Z w — » Z w is defined by g(x) = 7x + 2 (mod 10). Find its 
inverse function. 

Solution: From x = g(y) = 7y + 2 (mod 10), we obtain 

y = 7~ 1 (x — 2) = 3(x — 2) (mod 10). 

Hence, the inverse function g~ 1 :Z w — »• Z 10 is defined by g~ 1 (x) = 3(x — 2) (mod 10). A 

Hands-On Exercise 6.6.4 The function h: Z57 — > Z57 defined by h(x) = 49a: — 3 (mod 57). 
Find its inverse function. 


A 

Define h:Z w -A Z w according to h(x) = 2(x + 3) mod 10. Does h~ l exist? 

Solution: Since 2 _1 does not exist, we suspect the answer is no. In fact, h(x) is always even, 
and it is easy to verify that im h = {0, 2, 4, 6, 8}. Since h is not onto, h _1 does not exist. A 


Example 6.6.5 
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Example 6.6.6 Find the inverse function of f:Z — > NU {0} defined by 

,f(n) = { ; 


2 n if n > 0, 

. —2 n — 1 if n < 0. 


Solution: In an inverse function, the domain and the codomain are switched, so we have to 
start with / _1 : NU{0} — »• Z before we describe the formula that defines f~ 1 . Writing n = f(m), 
we find 

f 2m if m > 0, 

77 = s — 

l —2m — 1 if m < 0. 

We need to consider two cases. 


(i) If n = 2 to, then n is even, and to = ^. 

(ii) If n = —2m — 1, then n is odd, and m = — 
Therefore, the inverse function is defined by 


r 1 : NU{0}^Z, r\n) 


Y if n is even, 

— ^Y^ if n is odd. 


Verify this with some numeric examples. 


▲ 


Hands-On Exercise 6.6.5 The function f:Z — > N is defined as 


f —2 n if n < 0, 

\ 2n + 1 if n > 0. 


Find its inverse. 


A 

Let A and B be finite sets. If there exists a bijection /: A —> B, then the elements of A and 
B are in one-to-one correspondence via f. Hence, |H| = \B\. This idea provides the basis for 
some interesting proofs. 

Example 6.6.7 Let A = {a±, a, 2 , ■ ■ ■ , a n } be an n-element sets. Recall that the power set p(A) 
contains all the subsets of A, and 

{0, 1}" = {(6i, b- 2 , . . . , b n ) | bi € {0, 1} for each i , where 1 < * < n}. 

Define F: p(A) — >- {0, 1}™ according to F(S ) = (xi,X 2 , ■ ■ ■ , x n ), where 


_ f 1 if a, ; G S, 

1 \ 0 if a, S. 

Simply put, F(S) is an ordered n-tuple whose ?'th entry is either 1 or 0, indicating whether S 
contains the ith element of A (1 for yes, and 0 for no). 

It is clear that F is a bijection. For n = 8, we have, for example, 


F({a 2 , a 5 , a 8 }) = (0, 1, 0, 0, 1, 0, 0, 1), 
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and 

F _1 ((l, 1,0, 0,0, 1, 1,0)) = {ai, a 2 , a 6 , a 7 }. 

The function F defines a one-to-one correspondence between the subsets of A and the ordered 
n-tuples in {0, 1}". Since there are two choices for each entry in these ordered n-tuples, we have 
2" such ordered ?r-tuples. This proves that |p(A)| = 2™, that is, A has 2" subsets. ▲ 

Hands-On Exercise 6.6.6 Consider the function F defined in Example 6.6.7. Assume n = 8. 
Find F(0) and F _1 ((l, 0, 1, 1, 1, 0, 0, 0)) . 


A 


Summary and Review 

• A Injection is a function that is both one-to-one and onto. 

• The inverse of a Injection /: A -A B is the function f~ 1 : B -A A with the property that 

f(x) =y x = 

• In brief, an inverse function reverses the assignment rule of /. It starts with an element y 
in the codomain of /, and recovers the element x in the domain of / such that f(x) = y. 


Exercises 6.6 

1. Which of the following functions are bijections? Explain! 


(a) 

/ 

R — > ffi, f(x) = x 

3 - 2x s 

+ i. 

(b) 

9 

[2, oo ) — > R, g(x) 

= x 3 - 

- 2x 2 + 1 

(c) 

h 

R — > R, h(x) = e ] 

—2x 


(d) 

P 

R — > R, p(x) = 1 

— 3x\. 


(e) 

q 

[2, oo ) ->• [0, oo), 

q(x) = 

\/x — 2. 


2. For those functions that are not bijections in the last problem, can we modify their 
codomains to change them into bijections? 

3. Let / and g be the functions from (1, 3) to (4, 7) defined by 

, 3 5 , , , 3 17 

f( x ) = - x +-, and g{x) = -- x+ — . 

Find their inverse functions. Be sure to describe their domains and codomains. 

4. Find the inverse function /: R — >• K. defined by 

, v , f 3x + 5 if x < 6, 

/(l) = l5*-7 if*; 6. 

Be sure you describe f^ 1 correctly and properly. 

5. The function g: [1,3] —> [4, 7] is defined according to 

^ x) = \ll-2x if 2 < £ < 3. 


Find its inverse function. Be sure you describe it correctly and properly. 
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6. Find the inverse of the function r: (0, oo) — > R defined by r{x) = 4 + 3 In x. 

7. Find the inverse of the function s: R — ► (— oo, —3) defined by s(x) = 4 — 7e 2x . 

8 . Find the inverse of each of the following bijections. 

(a) h: {1, 2, 3, 4, 5} — > {a, b, c, d, e}, h{ 1) = e, h{ 2) = c, h( 3) = b , h{ 4) = a, /i(5) = d. 

(b) fc: {1,2, 3, 4, 5} -)> {1,2, 3, 4, 5}, fc(l) = 3, k( 2) = 1, fc(3) = 5, fc(4) = 4, k( 5) = 2. 

9. Find the inverse of each of the following bijections. 

(a) u: Q — > Q, u(x) = 3x — 2. 

(b) r,:Q-{l}-MJ-{2}, v(*) = ^. 

(c) w:Z— >• Z, w(n)=n + 3. 

10. Find the inverse of each of the following bijections. 

(a) r:Zi 2 — ► Z 12 , r(n ) = 7n (mod 12). 

(b) s:Z 33 — > Z 33 , s(n) = 7n + 5 (mod 33). 

(c) t:K — > N u {()}, <(n) = { 2 _“; 1 

11. The images of the bijection a: {1, 2, 3, 4, 5, 6, 7, 8} -» {a, 6, c, d, e, /, g, h} are given below. 


X 

1 

2 

3 

4 

5 

6 

7 

8 

a{x) 

9 

a 

d 

h 

b 

e 

/ 

c 


Find its inverse function. 


12. Below is the incidence matrix for the bijection [i: {a, b, c, d, e, /} — > {x, y, z, u, v, u>}. 


Find its inverse function. 


a 

b 


c 

d 


e 

f 


u v w x y z 

/ 0 1 0 0 0 0 \ 

1 0 0 0 0 0 

0 0 0 0 1 0 

0 0 1 0 0 0 

0 0 0 0 0 1 

\000100 / 
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Given functions f:A—¥B and g: B —$■ C, the composite function , go f , which is pronounced 
as “g circle /”, is defined as 


gof:A->C, {go f)(x)= g(f{x)). 


The image is obtained in two steps. First, f{x) is obtained. Next, it is passed to g to obtain 
the final result. It works like connecting two machines to form a bigger one, see Figure 6.7. We 
can also use an arrow diagram to provide another pictorial view, see Figure 6.8. 

Numeric value of ( go f){x ) can be computed in two steps. For example, to compute (<7°/)(5), 
we first compute the value of /(5), and then the value of g{f{ 5)). To find the algebraic description 
of ( g o f){x), we need to compute and simplify the formula for g{f{x)). In this case, it is 
often easier to start from the “outside” function. More precisely, start with g , and write the 


Write ( g o f)(x) 
instead of g o f(x). 
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X 



9(f(*)) 


X 

1 



s(/(®)) 


Figure 6.7: A composite function, viewed as input-output machines. 



Figure 6.8: Another pictorial view of a composite function. 


intermediate answer in terms of f(x), then substitute in the definition of f(x) and simplify the 
result. 


Example 6.7.1 Assume /, g:R — > K are defined as f(x) = x 2 , and g(x) = 3x + 1. We find 

(s ° f)(x) = g{f{x)) = 3[/(x)] + 1 = 3a; 2 + 1, 

(/ ° 9 )(x) = f{g(x)) = [g(x)) 2 = (3x + l) 2 . 

Therefore, 

gof:R^R, (g o f)(x) = 3x 2 + 1, 

fog-.R^-R, (/ ° g){x) = (3a; + l) 2 . 

We note that, in general, / ° g ^ g ° /■ A 

Hands-On Exercise 6.7.1 If p, q:R -A K are defined as p(x) = 2x + 5, and q(x) = x 2 + 1, 
determine po q and q o p. Do not forget to describe the domain and the codomain. 


A 


Hands-On Exercise 6.7.2 The functions /, <?:Z 12 — > Z 12 are defined by 


f(x) = lx + 2 (mod 12) 


and g(x) = 5x — 3 (mod 12). 
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Compute the composite function / o g. 


A 


Example 6.7.2 Define /, g: K. — ► ffi. as 

. , , _ f 3x + 1 if x < 0, 
nx) ~\2x + 5 if re > 0, 

and g(x) = 5x — 7. Find go f . 


Solution: Since / is a piecewise-defined function, we expect the composite function go f is also 
a piecewise-defined function. It is defined by 

(9 o f)(x) = g(f(x)) = 5/(z) - 7 = | I ] ^ J 

After simplification, we find 


gof:R^R, (g°f)(x) 


15x - 2 if re < 0, 
10a’ + 18 if x > 0. 


In this example, it is rather obvious what the domain and codomain 
always a good practice to include them when we describe a function. 


Hands-On Exercise 6.7.3 The functions f:\ 


f (x) =3x + 2, 


and 


9 Or) = 


and g: 

..2 


are 


x“ if x < 
2x — 1 if re > 


are. Nevertheless, it is 

▲ 


defined by 

5, 

5. 


Determine fog. 


A 


The next example further illustrates why it is often easier to start with the outside function 
g in the derivation of the formula for g{f{x)). 


Example 6.7.3 The function p: [1, 5] — > K is defined by 


and the function q: R — ► R by 


p{x) = 


<7 Or) 


2x + 3 

if 1 < x < 3 

5a; - 2 

if 3 < x < 5 

f 4x 

if x < 7, 

~ \3x 

if x > 7. 


Describe the function go p. 
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Solution: Since 


(qop)(x) = q(p(x)) 


4 p(x) if p(x) < 7, 
3 p(x) if p(x) > 7, 


we have to find out when will p(x) < 7, and when will p(x) > 7, because these conditions 
determine what we need to do next to continue the computation. Since p(x) is computed in two 
different ways, we have to analyze two cases. 


• Case 1: 1 < x < 3. In this case, p(x) is defined as 2x + 3. This is an increasing function, 
hence, 

p(x) > p{ 1) = 2-14-3 = 5, and p(x) < p( 3) = 2 • 3 + 3 = 9. 

For some xs in this range, we have p(x) < 7, but for other x-values, we have p(x) > 7. 
We need to know the cut-off point. This happens when p[x) = 2x + 3 = 7, that is, when 
x = 2. This leads to two subcases. 


— Case la: When 1 < x < 2, we have p{x) = 2x + 3 < 7. Thus, 

q(p{x)) = 4 p{x) = 4(2x + 3) = 8x + 12. 


— Case lb: When 2 < x < 3, we have p(x) = 2x + 3 > 7. Thus, 

q(p(x)) = 3 p(x) = 3(2a; + 3) = 6x + 9. 


• Case 2: 3 < x < 5. In this case, p(x) is computed as 5x — 2. This is an increasing 
function, hence p(x) > p( 3) = 5 • 3 — 2 = 13. Since p(x) is always greater than 7, we find 

q{p{x)) = 3 p(x) = 3(5x — 2) = 15x — 6. 


Combining these cases, we determine that the composite function q op : [1, 5] — > R is defined by 


{qop)(x) 


8x + 12 if 1 < x < 2, 
6x + 9 if 2 < x < 3, 
15a: - 6 if 3 < x < 5. 


Study this example again to make sure that you understand it thoroughly. 


A 


Hands-On Exercise 6.7.4 The functions /, g:Z — > Z are defined by 


( n + 1 if n is even 
l n — 1 if n is odd 


( n + 3 if n is even 
l n — 7 if n is odd 


Determine fog. 


A 

Strictly speaking, g o f is well-defined if the codomain of / equals to the domain of g. It is 
clear that g o f is still well-defined if im / is a subset of the domain of g. Hence, if 

f:A^B, g:C^D, 

then g o f is well-defined if B C C , or more generally, im / C C. 
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Example 6.7.4 Let R* denote the set of nonzero real numbers. Suppose 

/:R*-»R, /( x) = l/x, 

g: R — > (0, oo), g{x) = 3x 2 + 11. 

Determine fog and g o f . Be sure to specify their domains and codomains. 

Solution: To compute fog, we start with g, whose domain is R. Hence, R is the domain of 
/ o g. The result from g is a number in (0,oo). The interval (0, oo) contains positive numbers 
only, so it is a subset of R*. Therefore, we can continue our computation with /, and the final 
result is a number in R. Hence, the codomain of / o g is R. The image is computed according 
to f(g( x)) = 1 /g(x) = l/(3a; 2 + 11). We are now ready to present our answer: 

/o 5 :R->R, (/ ° g)(x) = ^ 

In a similar manner, the composite function g o f: R* — » (0, oo) is defined as 

{9°f){x) = \+ 11 - 

x z 

Be sure you understand how we determine the domain and codomain of g o f. ▲ 

Hands-On Exercise 6.7.5 Let Z denote the set of integers. Determine hog, where 

g- Z->R, g(x) = a/R, 
h: R — » R, h( x) = {x — 5) 2 . 


Is g o h well-defined? Explain! 


A 


As usual, take extra caution with modular arithmetic. 

Example 6.7.5 Define /: Z 15 — > Z 23 and g: Z 23 —> Z 32 according to 

f(x) = 3x + 5 (mod 23), 
g(x) = 2x + 1 (mod 32). 

We may expect go f: Z15 — >• Z23 to be defined as 

(g o f){x) = 2{3x + 5) + 1 = 6x + 11) (mod 32). 

In particular, (g o /)( 8) = 59 = 27 (mod 32). 

If we perform the computation one step at a time, we find /( 8) = 29 = 6 (mod 23), from 
which we obtain 

(3 0 /)( 8) = g{f{ 8)) = 3(6) = 13 (mod 32). 
which is not what we have just found. Can you explain why? 

Solution: The source of the problem is the different moduli used in / and g. The composite 
function should be defined as 


(g o f)(x) = 2r + 1 (mod 32) 


where r = 3:r + 5 (mod 23) . 
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In a way, this definition forces us to carry out the computation in two steps. Consequently, we 
will obtain the correct answer ( g o /)( 8 ) = 13. A 


There is a closed connection between a bisection and its inverse function, from the perspective 
of composition. 

Theorem 6.7.1 For a bijective function f:A—¥B, 

f~ 1 of = i A , and fof~ 1 = i Bl 
where i A and i B denote the identity function on A and B, respectively. 

Proof: To prove that / -1 o/ = i A , we need to show that (/ -1 o/)(a) = a for all a € A. Assume 
/(a) = b. Then, because / -1 is the inverse function of /, we know that / _ 1 ( 6 ) = a. Therefore, 

(/ _1 ° f)(a) = / _ 1 (/(a)) = / _ 1 ( 6 ) = a, 

which is what we want to show. The proof of / o / -1 = i B proceeds in the exact same manner, 
and is omitted here. ■ 

Example 6.7.6 Show that the functions f,g:M. — > R. defined by f(x) = 2x + 1 and g{x) = 
2 (x — 1 ) are inverse functions of each other. 

Remark. The problem does not ask you to find the inverse function of / or the inverse function 
of g. Instead, the answers are given to you already. You job is to verify that the answers are 
indeed correct, that the functions are inverse functions of each other. < 0 > 


To verify that g is 
the inverse function of 
f, you just have to 
check whether both 
fog and g o f are 
identity functions. 


Solution: Form the two composite functions fog and go f, and check whether they both equal 
to the identity function: 

(/ ° 9){x) = f{g(x)) = 2 g(x) + 1 = 2 [\{x - 1 )] + 1 = x, 

(. 9 ° /)(+) = 9(f(x)) = b [f(x) - 1 ] = 5 [( 2 a; + 1 ) - 1 ] = x. 

We conclude that / and g are inverse functions of each other. A 


Hands-On Exercise 6.7.6 Verify that /:R — ► R + defined by f(x) = e x , and c/:R + R 
defined by g(x) = lira:, are inverse functions of each other. 


A 

Theorem 6.7.2 Suppose f: A — ► B and g: B — » C. Let i A and i B denote the identity function 
on A and B, respectively. We have the following results. 

(a) foi A = f and i B ° f = /• 

(b) If both f and g are one-to-one, then g o / is also one-to-one. 

(c) If both f and g are onto, then g o / is also onto. 

(d) If both f and g are bijective, then go f is also bijective. In fact, (go /) -1 = / -1 o + 1 . 

Proof: (a) To show that / o i A = f , we need to show that (/ o i A )(a) = /(a) for all a £ A. 

This follows from direct computation: 

{foi A ){a) = f{i A {a)) = /(«)• 


The proofs of i B o f = / and (b)-(d) are left as exercises. 
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Example 6.7.7 The converses of (b) and (c) in Theorem 6.7.2 are false, as demonstrated in 
the functions 

/: Z — > Z, /( x) = 2x , 

g:Z-»Z, g(x)=|_a:/2j. 

Here, g o / = * z , so g o / is one-to-one, and it is obvious that / is also one-to-one, but g is not 
one-to-one. It is easy to see that both g and go f are onto, but / is not. ▲ 

Summary and Review 

• The composition of two functions f: A — »• B and g: B — > C is the function g o /: H » C 
defined by ( g o f)(x) = g{f{x)). 

• If f: A — ¥ B is bijective, then / _1 o f = i A and / o / _1 = i B . 

• To check whether f: A ^ B and g: B — ► H are inverse of each other, we need to show that 

— (go f )(x) = q( f(x)) = x for all x € A, and 

- (/ ° g)(y) = f{g{y )) = 2 / for all y &b. 

Exercises 6.7 

1. The functions g, f: R — > K. are defined by f(x) = 5x — 1 and g(x) = 3x 2 + 4. Determine 
fog and g o /. 

2. The function h: (0, oo) — > (0, oo) is defined by h(x) = x + Determine h o h. Simplify 
your answer as much as possible. 

3. The functions g,f:M. — > R are defined by f(x) = 1 — 3a; and g(x) = x 2 + 1. Evaluate 
/(5(/(0))). 

4. The functions p: (2, 8] — >• R. and q: R — > R are defined by 

f 3a; — 1 if 2 < x < 4, 

\ 17 - 2a; if 4 < x < 8, 

f 4a; — 1 if x < 3, 

( 3x + 1 if x > 3. 

Evaluate q o p. 

5. Describe go/. 

(a) /: Z — > N, /(n) = n 2 + 1; g:N -MQ, g(n) = 

(b) /: R — t (0, 1), /(x) = l/(a; 2 + 1); g: (0, 1) ->• (0, 1), g(x) = 1 - x. 

(c) /:Q - {2} Q*, /(*) = l/(*-2); g:Q*-MQ*, g(x) = 1/x. 

(d) /: R — >• [ 1, oo), f(x)=x 2 + 1; g: [ 1, oo) -4 [0, oo) g(x) = yjx-l. 

(e) /:Q-{ 10/3} {3}, /(*) = 3x - 7; g: Q - {3} ^ Q - {2}, g(ar) = 

2x/ (x — 3). 

6. Describe go/. 

(a) /: Z — ► Z 5 , /(n) = n (mod 5); g: Z 5 — > Z 5 , g(n) = n + 1 (mod 5). 

(b) /: Z§ — > Z 12 , /(n) = 3n (mod 12); g: Z 12 — ► Zq, g(n) = 2 n (mod 6). 

7. Describe go/. 

(a) /: {1, 2, 3, 4, 5} -> {1, 2, 3, 4, 5}, /(l) = 5, /( 2) = 3, /( 3) = 2, /( 4) = 1, /(5) = 4; 

g: {1, 2, 3, 4, 5} ^ {1, 2, 3, 4, 5}; g(l) = 3, g( 2) = 1, g(3) = 5, g(4) = 4, g(5) = 2 


p{x) 

q{x) 
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(b) f:{a,b,c,d,e} -> { 1,2, 3, 4, 5}; 
g-{ 1,2, 3, 4, 5} {a, b,c, d, e}; 


/(a) = 5, f(b) = 1, /(c) = 2, /(d) = 4, /(e) = 3; 
5(1) = e > 5( 2 ) = d, g{ 3) = a, 5(4) = c, y(5) = 6 


8 . Verify that /, </: R — » R defined by 


f 11 — 2a: if x < 4 
\ 15 — 3a: if a; > 4 


are inverse to each other. 


and g(x) 


| (15 — a;) if x < 3 
|(11 — x) if x > 3 


9. The functions /, g: Z — > Z are defined by 


f(n) = { 


2 n- 1 
2 n 


if n > 0 
if n < 0 


and 


( n + 1 if n is even 
l 3n if n is odd 


Determine g o /. 

10. Define the functions / and g on your maternal family tree (see Problem 8 in Exercises 6.2) 
according to 


f{x) = the mother of x, 

g(x) = the eldest daughter of the mother of x. 


Describe these functions. 

(a) / o g 
(c) f of 


(b ) go f 
(d) g o g 


11. Given the bijections / and g, find / ° g, (/ ° g) 1 and g 1 o / 


(a) / 

(b) / 

(c) / 

(d) / 


g: Z — > Z, 




^(n) =2 - n. 

5 “ 


Z — >• Z, /(n) = n + 1 

Q ->• Q, /(a) = 5a;; 

Q - {2} -> Q - {2}, /( a:)=3a:^4; 5 -: Q {2} 

Z 7 -A- Z 7 , /(n) = 2n + 5 (mod 7); g: Z 7 — >■ Z 7 , 


-aQ-{2}, g(x) = ^. 
g(ri) = 3n — 2 (mod 7). 

B and g: B — >■ C, such that 


12. Give an example of sets A, B , and C, and of functions /: A 
g o / and / are both one-to-one, but g is not one-to-one. 

13. Prove part (b) of Theorem 6.7.2. 

14. Prove part (c) of Theorem 6.7.2. 

15. Prove part (d) of Theorem 6.7.2. 

16. The incidence matrices for the functions /: {a, b , c, d , e} -A {a;, y, 2 , 11 ;} and g: {x, y, z, 11 ;} 
{1,2, 3, 4, 5, 6 } are 


a 

b 

c 

d 

e 


x y z w 

( 0 0 1 0 \ 

0 10 0 

0 0 10 
10 0 0 

\ 1 0 0 0 J 


and 




1 

2 

3 

4 

5 

6 

X 

( 

0 

0 

1 

0 

0 

° \ 

y 


0 

0 

0 

0 

0 

1 

z 


0 

1 

0 

0 

0 

0 

w 


1 

0 

0 

0 

0 

0 / 


respectively. Construct the incidence matrix for the composition g o /. 
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Relations 


7.1 Definition of Relations 


Given two nonempty sets A and B , a function tells us how to obtain a unique element b £ B 
from any element a € A. Very often, we are only interested in some sort of relationship between 
the elements from these two sets. A familiar example is the equality of two numbers. By saying 
a = b, we are proclaiming that the two numbers a and b are related by being equal in value. 
Likewise, a > b is another example of a relation. 

Example 7.1.1 Given a,b £ R*, declare a and b to be related if they have the same sign. For 

instance, 7.14 and e are related, so are —n and — \p2. However, 5 and —2 are not. Note that a 
is related to b implies that b is also related to a. A 

Example 7.1.2 For a,b £ R, define “a is related to 6” if and only if a < b. Take note that 
3 < 5, but 5 f 3. This demonstrates that a is related to b does not necessarily imply that b is 
also related to a. A 


Example 7.1.3 Let A be a set of students, and let B be a set of courses. Given a £ A and 
b € iTTclenne “a is related to 6” if and only if student a is taking course b. While it could be 
possible that “John Smith is related to MATH 210” because John is taking MATH 210, it is 
certainly absurd to say that “MATH 210 is related to John Smith,” because it does not make 
much sense to say that MATH 210 is taking John Smith. This again illustrates that a is related 
to b does not necessarily imply that b is also related to a. A 


In these examples, we see that when we say “a is related to 6,” the order in which a and b 
appear may make a difference. This suggests the following definition. 

Definition. A relation from a set A to a set B is a subset of A x B. Hence, a relation R 
consists of ordered pairs (a, b ), where a £ A and b £ B. If (a, b ) £ R , we say that a is related 
to b , and we also write aRb. <C> 


Remark. We can also replace R by a symbol, especially when one is readily available. This is 
exactly what we do in, for example, a < b. To say it is not true that a <b, we can write a -f. b. 
Likewise, if (a, b) (f R, then a is not related to b , and we could write a fib. But the slash may 
not be easy to recognize when it is written over an uppercase letter. In this regard, it may be 
a good practice to avoid using the slash notation over a letter. Alternatively, one may use the 
“bar” notation aRb to indicate that a and b are not related. <0> 

Example 7.1.4 Define R = {( a,b ) £ R 2 | a < b}, hence (a, b) £ R if and only if a < b. 
Obviously, saying “a < b” is much clearer than “a R 6.” If a and b are not related, we could say 
(a, b) f R, or a ft b. A 


A relationship can be 
one-way only. 


The two sets need 
not be the same. 


A relation is a set of 
ordered pairs, we also 
write aRb if a is 
related to b. 
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Example 7.1.5 Define 


F = { (x, y) G 



Therefore x is related to y if and only if y — We can also write 


F = 





which may look a bit simpler. 

For instance, (1, 0.5) € F, but (1, 0) ^ F. In this case, (2, 0.2) G F is probably easier to 
understand than 2F0.2. Likewise, (1,2) ^ F may be easier to read than 1^2. A 

Hands-On Exercise 7.1.1 Define the relation H as {( x,x 2 + 1) | x € M}. Determine whether 
the following statements 

2H3, (-4,17 )$H, (|,| )iH, (>/2,3 ) € H, (1,2 ) G H, 

are true or false. 


A 

Hands-On Exercise 7.1.2 Let G = {( x,y ) G K 2 | xy = 1}. Is 2 related to 0.5? How would 
you write it? Repeat with 4 and 0.5, and with 10 and 3. 


A 

Hands-On Exercise 7.1.3 In the last example, is 0 related to 3? How would you write it? 
Repeat with 1 and —1. Again with 4= and y/2. 


A 

Since a relation is a set, we can describe a relation by listing its elements (that is, using the 
roster method). 


Example 7.1.6 Let A = {1, 2, 3,4, 5, 6} and B = {1,2, 3,4}. Define (a, 6) G R if and only if 
(a — b ) mod 2 = 0. Then 


R = {(1, 1), (1, 3), (2, 2), (2, 4), (3, 1), (3, 3), (4, 2), (4,4), (5, 1), (5, 3), (6, 2), (6, 4)}. 


We note that R consists of ordered pairs (a, b) where a and b have the same parity. Be cautious, 
that 1 < a < 6 and 1 < b < 4. Hence, it is meaningless to talk about whether (1,5) G R or 
(1,5 )<£ R . A 


Hands-On Exercise 7.1.4 Let A = {2,3,4, 7} and B = {1, 2, 3, . . . , 12}. Define a S b if and 
only if a \ b. Use the roster method to describe S. 


A 
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In the last example, 7 never appears as the first element (in the first coordinate) of any 
ordered pair. Likewise, 1, 5, 7, and 11 never appear as the second element (in the second 
coordinate) of any ordered pair. 

Definition. The domain of a relation R C A x B is defined as 

clomi? = {a € A \ (a,b) € R for some b € B}, 
and the image or range is defined as 

imiZ = {b € B | (a, b) € R for some a € A}. 

Hands-On Exercise 7.1.5 Find domS and im S, where S in Hands-On Exercise 7.1.4. 


A 

A relation R C A x B can be displayed graphically on a digraph which is also called a 
directed graph. Represent the elements from A and B by vertices or dots , and use directed 
lines (also called directed edges or arcs) to connect two vertices if the corresponding elements 
are related. Figure 7.1 displays a graphical representation of the relation in Example 7.1.6. 

12 3 4 

B 


A 

1 2 3 4 5 6 



Figure 7.1: The graphical representation of the a relation. 


Although a digraph gives us a clear and precise visual representation of a relation, it could 
become very confusing and hard to read when the relation contains many ordered pairs. As 
we will see in Section 7.4, we can sometimes simplify the digraphs in some special situations. 
Otherwise, the graphical representation is only effective for relations with a small number of 
ordered pairs. 

We can use a matrix representation to describe a relation. A matrix consists of values 
arranged in rows and columns. A relation R from A = {cti, . . . , a m } to B = {6j_ , . . . , b n } can be 
described by an m-by-n matrix M = ( mij ) whose entry at row i and column j is defined by 


rriij 


1 if a,; R bj , 
0 otherwise. 


The matrix M is called the incidence matrix for R. 


Example 7.1.7 The incidence matrix for the relation R in Example 7.1.6 is 


1 

2 

3 

4 

5 

6 


12 3 4 

/ 1 0 1 0 \ 
0 10 1 
10 10 
0 10 1 
10 10 
\0 1 0 1 / 


in which we label the rows and columns with the elements involved in the relation. 


▲ 
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Hands-On Exercise 7.1.6 Determine the incidence matrix for the relation S in Hands-On 
Exercise 7.1.4. 


A 

Hands-On Exercise 7.1.7 The courses taken by John, Mary, Paul, and Sally are listed below. 

John: MATH 210, CSIT 121, MATH 223 

Mary: MATH 231, CSIT 121, MATH 210 
Paul: CSIT 120, MATH 231, MATH 223 

Sally: MATH 210, CSIT 120 

Represent, using a graph and a matrix, the relation R defined as aRb if student a is taking 
course b. 


A 

Summary and Review 

• Relations are generalizations of functions. A relation merely states that the elements from 
two sets A and B are related in a certain way. 

• More formally, a relation is defined as a subset of Ax B. 

• The domain of a relation is the set of elements in A that appear in the first coordinates 
of some ordered pairs, and the image or range is the set of elements in B that appear in 
the second coordinates of some ordered pairs. 

• For brevity and for clarity, we often write xRy if (x,y) £ R. 

• Under this convention, the mathematical notations <, >, =, C, and their like, can be 
regarded as relational operators. 

Exercises 7.1 

1. Represent each of the following relations from {1, 2, 3, 6} to {1, 2, 3, 6} using a digraph and 
an incidence matrix. 

(a) {(x,y) I x = y} (b) {(x, y) \ x £ y} (c) {{x, y) \ x < y} 

2. Find the domain and image of each relation in Problem 1. 

3. Represent each of the following relations from {1, 2, 3, 6} to {1, 2, 3, 6} using a digraph and 
an incidence matrix. 


(a) {(x,y) | x 2 <y} 


(d) {(x,y) | x divides y} 


(c) {(x, y) | x + y is even} 
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4. Find the domain and image of each relation in Problem 3. 

5. Find the incidence matrix for each of the following relations from {1, 2, 3, 4} to {1, 2, 3, 4, 5}. 

(a) R={( 1,1), (2, 2), (2, 3), (3, 3), (3, 4), (4, 5)} 

(b) S = {(1, 1), (1, 2), (2, 2), (2, 3), (3, 3), (3, 4), (4, 4)} 

(c) T = {(1,5), (2, 4), (3, 3), (4,1), (4, 4)} 

6. Determine the incidence matrix and the digraph that represent the relation R defined on 
{x € Z \ —3 < x < 3} by 

xRy <t=> 3 | (x — y). 

7. Determine the incidence matrix and the digraph that represent the relation S defined on 
{1,2,4,5,10,20} by 

x S y <t=> (x < y and x divides y). 

8. Let D = {1, 2, 3, ... , 30} be the set of dates in November, and let W = {Sunday, Monday, 
Tuesday, Wednesday, Thursday, Friday, Saturday} be the set of days of the week. For 
November of this year, define the relation T from D to W by 

(x, y) £ T <t=> x falls on y. 

List the ordered pairs in T. Is T a function from T to W? 

9. Find the incidence matrix for the relation I C p({l, 2}) x p({l, 2}), where 

(5,T)e/»5nr/0. 

10. For a relation R C A x A, instead of using two rows of vertices in a digraph, we can 
use a digraph on the vertices that represent the elements of A. Hence, it is possible to 
have two directed arcs between a pair of vertices, and a loop may appear around a vertex 
x if ( x,x ) £ R. Find the incidence matrix for the relation represented by the following 
digraph: 

d 


a 


7.2 Properties of Relations 

If R is a relation from A to A, then R C A x A; we say that I? is a relation on A. 

Definition. A relation I? on A is said to be 

• reflexive if (a, a) £ Ra for all a £ A, 

• irreflexive if (a, a) ^ R for all a £ A, 

• symmetric if (a, b) £ R => (b, a) £ R for all a,b £ A, 

• antisymmetric if [(a, b) £ R A (6, a) £ R] =>■ a = b for all a,b £ A, 

• transitive if [(a, b) £ R A (6, c) € R\ => (a, c) £ R for all a, 6, c £ A. 

These are important definitions, so let us repeat them using the relational notation a Rb: 





202 


Chapter 7 Relations 


• reflexive if aRa for all a £ A, 

• irreflexive if a ft a (that is, aRa) for all a € A, 

• symmetric if aRb => b R a for all a, b £ A, 

• antisymmetric if [(aRb) A (bRa)] a = b for all a, b £ A, 

• transitive if [(a Rb) A (b Re)] => a Rc for all a,b,c£ A. <0> 

Remark. A relation cannot be both reflexive and irreflexive. Hence, these two properties are 
mutually exclusive. If it is reflexive, then it is not irreflexive. If it is irreflexive, then it cannot 
be reflexive. Nonetheless, it is possible for a relation to be neither reflexive nor irreflexive. <0> 

Remark. Many students find the concept of symmetry and antisymmetry confusing. Even 
though the name may suggest so, antisymmetry is not the opposite of symmetry. It is possible 
for a relation to be both symmetric and antisymmetric, and it is also possible for a relation to 
be both non-symmetric and non-antisymmetric. A good way to understand antisymmetry is to 
look at its contrapositive: 

a ^ b => (a, b) £ R A (b, a) £ R. 

Thus, if two distinct elements a and b are related (not every pair of elements need to be related), 
then either a is related to b , or b is related to a, but not both. Consequently, if we find distinct 
elements a and b such that (a, b) £ R and (6, a) £ R, then R is not antisymmetric. <0> 

Example 7.2.1 The empty relation is the subset 0. It is clearly irreflexive, hence not 
reflexive. To check symmetry, we want to know whether aRb => bRa for all a,b £ A. More 
specifically, we want to know whether (a, b) £ 0 =>• (b, a) £ 0. Since (a, b) £ 0 is always false, the 
implication is always true. Thus the relation is symmetric. Likewise, it is antisymmetric and 
transitive. 

The complete relation is the entire set A x A. It is clearly reflexive, hence not irreflexive. 
It is also trivial that it is symmetric and transitive. It is not antisymmetric unless |A| = 1. 

The identity relation consists of ordered pairs of the form (a, a), where a £ A. In other 
words, aRbii and only if a = b. It is reflexive (hence not irreflexive), symmetric, antisymmetric, 
and transitive. A 


Example 7.2.2 Consider the relation R on the set A = {1, 2, 3, 4} defined by 

R= {(1,1), (2, 3), (2, 4), (3, 3), (3, 4)}. 

• Since (2,2) ^ R, and (1, 1) £ R, the relation is neither reflexive nor irreflexive. 

• We have (2, 3) £ R but (3, 2) ^ R, thus R is not symmetric. 

• For any a ^ 6, only one of the four possibilities (a, b) ^ R , (b, a) ^ R , (a, b) £ R, or 
( b , a) £ R can occur, so R is antisymmetric. 

• By going through all the ordered pairs in R , we verify that whether (a, b) £ R and (6, c) £ 
R, we always have (a, c) £ R as well. This shows that R is transitive. 

Therefore, R is antisymmetric and transitive. A 

Example 7.2.3 Define the relation S on the set A = {1, 2, 3,4} according to 

5 = {(2,3), (3, 2)}. 

• Since (1, 1), (2, 2), (3, 3), (4, 4) ^ S, the relation S is irreflexive, hence, it is not reflexive. 

• Since we have only two ordered pairs, and it is clear that whenever (a, b) £ S , we also have 
( b , a) £ S. Hence, S is symmetric. 

• We have both (2,3) £ S and (3,2) £ S, but 2^3. Hence, S is not antisymmetric. 

• Since (2, 3 ) £ S and (3, 2) £ 5, but (2, 2) ^ S, the relation S is not transitive. 

We conclude that S is irreflexive and symmetric. A 
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Hands-On Exercise 7.2.1 Define the relation R on the set R as 

aRb <t=> a < b. 

Determine whether R is reflexive, irreflexive, symmetric, antisymmetric, or transitive. 


Hands-On Exercise 7.2.2 The relation S on the set R* is defined as 

aSb <t=> ab > 0. 

Determine whether S is reflexive, irreflexive, symmetric, antisymmetric, or transitive. 


A 


A 

Example 7.2.4 Here are two examples from geometry. Let T be the set of triangles that can 
be drawn on a plane. Define a relation S on T such that (Ti,T 2 ) £ S if and only if the two 
triangles are similar. It is easy to check that S is reflexive, symmetric, and transitive. 

Let £ be the set of all the (straight) lines on a plane. Define a relation P on £ according to 
(Li,L 2 ) £ P if and only if L\ and L 2 are parallel lines. Again, it is obvious that P is reflexive, 
symmetric, and transitive. ▲ 

Example 7.2.5 The relation T on R* is defined as 

aTb 7 eQ. 
b 

• Since ^ = 1 £ Q, the relation T is reflexive; it follows that T is not irreflexive. 

• The relation T is symmetric, because if | can be written as — for some integers m and n, 
then so is its reciprocal - . because - = A- 

• Since y/2 TV 18 and Ty/2, yet ^ ^18, we conclude that T is not antisymmetric. 

• If f > c £ Q, then | = ™ and | | for some nonzero integers m, n, p , and q. Then 

5 = |.| = ^£ e Q. Hence, T is transitive. 

Therefore, the relation T is reflexive, symmetric, and transitive. ▲ 

Hands-On Exercise 7.2.3 Consider the relation T on N defined by 

aTb <t=> a \ b. 
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Determine whether T is reflexive, irreflexive, symmetric, antisymmetric, or transitive. 


Hands-On Exercise 7.2.4 The relation U on the set Z* is defined as 

aUb +> a | b. 

Determine whether U is reflexive, irreflexive, symmetric, antisymmetric, or transitive. 


A 


A 

Example 7.2.6 The relation U on Z is defined as 

aU b +> 5 | (a + b). 

• The relation U is not reflexive, because 5 j (1 + 1). 

• It is not irreflexive either, because 5 | (10 + 10). 

• If 5 | (a + 6), it is obvious that 5 | (b + a) because a + b = b + a. Thus, U is symmetric. 

• We claim that U is not antisymmetric. For example, 5 | (2 + 3) and 5 | (3 + 2), yet 2^3. 

• It is not transitive either. For instance, 5 | (1 + 4) and 5 | (4 + 6), but 5 \ (1 + 6). 

The relation U is symmetric. ▲ 

Hands-On Exercise 7.2.5 Determine whether the following relation V on some universal set 
IA is reflexive, irreflexive, symmetric, antisymmetric, or transitive: 

(S,T) € V +> S C T. 


A 
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Example 7.2.7 Consider the relation V on the set A — {0, 1} is defined according to 

V = {(0,0), (1,1)}. 

• The relation V is reflexive, because (0,0) € V and (1, 1) £ V. Hence, it is not irreflcxive. 

• It is clearly symmetric, because (a, b) G V always implies (b, o) € V. 

• Indeed, whenever (a, b ) € V, we must also have a = b, because V consists of only two 
ordered pairs, both of them are in the form of (a, a). It follows that V is also antisymmetric. 

• A similar argument shows that V is transitive. 

The relation is reflexive, symmetric, antisymmetric, and transitive. ▲ 

Determine whether the following relation W on a nonempty set of 
is reflexive, irreflexive, symmetric, antisymmetric, or transitive: 

W b a and b have the same last name. 


Hands-On Exercise 7.2.6 

individuals in a community 

a 


A 

Example 7.2.8 Define the relation IP on a nonempty set of individuals in a community as 

aWb <t=> a is a child of b. 

• Nobody can be a child of himself or herself, hence, W cannot be reflexive. Instead, it is 
irreflexive. 

• It is obvious that W cannot be symmetric. 

• It may sound weird from the definition that W is antisymmetric: 

(a is a child of b) A (b is a child of a) => a = b, (7.1) 

but it is true! The reason is, if a is a child of 6, then b cannot be a child of a. This makes 
conjunction 

(a is a child of b) A (5 is a child of a) 
false, which makes the implication (7.1) true. 

• A similar argument holds if b is a child of a, and if neither a is a child of b nor b is a 
child of a. No matter what happens, the implication (7.1) is always true. Therefore W is 
antisymmetric. 

• It may help if we look at antisymmetry from a different angle. The contrapositive of the 
original definition asserts that when a ^ b 1 three things could happen: 

(i) a and b are incomparable (aW b and bW a), that is, a and b are unrelated; 
and if a and b are related, then either 

(ii) aWb but b W a, or 

(iii) bW a but AWb. 






206 


Chapter 7 Relations 


Using this observation, it is easy to see why W is antisymmetric. 

• It is clear that W is not transitive. 

The relation is irreflexive and antisymmetric. A 

Instead of using two rows of vertices in the digraph that represents a relation on a set A, we 
can use just one set of vertices to represent the elements of A. A directed line connects vertex 
a to vertex b if and only if the element a is related to the element b. If b is also related to a, the 
two vertices will be joined by two directed lines, one in each direction. If a is related to itself, 
there is a loop around the vertex representing a. See Problem 10 in Exercises 7.1. 

From the graphical representation, we determine that the relation R is 

• Reflexive if there is a loop at every vertex of G. 

• Irreflexive if G is loopless. 

• Symmetric if every pair of vertices is connected by none or exactly two directed lines in 
opposite directions. 

• Antisymmetric if every pair of vertices is connected by none or exactly one directed line. 

• Transitive if for every unidirectional path joining three vertices a, 6, c, in that order, there 
is also a directed line joining a to c. 

The incidence matrix M = (rn l j) for a relation on A is a square matrix. We find that R is 

• Reflexive if every entry on the main diagonal of M is 1. 

• Irreflexive if every entry on the main diagonal of M is 0. 

• Symmetric if M is symmetric, that is, mtj = rriji whenever i ^ j. 

• Antisymmetric if i ^ j implies that at least one of and rriji is zero, that is, = 0. 

• Transitive if (M 2 )^ > 0 implies m,j > 0 whenever i ^ j. 

For instance, the incidence matrix for the identity relation consists of Is on the main diagonal, 
and Os everywhere else. This is called the identity matrix. If a relation R on A is both symmetric 
and antisymmetric, its off-diagonal entries are all zeros, so it is a subset of the identity relation. 

It is an interesting exercise to prove the test for transitivity. Apply it to Example 7.2.2 to 
see how it works. 

Summary and Review 

• A relation from a set A to itself is called a relation on A. 

• Given any relation R on a set A, we are interested in five properties that R may or may 
not have. 

• The relation R is said to be reflexive if every element is related to itself, that is, if xRx 
for every x £ A. 

• The relation R is said to be irreflexive if no element is related to itself, that is, if x ft x for 
every x € A. 

• The reflexive property and the irreflexive property are mutually exclusive, and it is possible 
for a relation to be neither reflexive nor irreflexive. 

• The relation R is said to be symmetric if the relation can go in both directions, that is, if 
xRy implies yRx for any x,y £ A. 

• The relation R is said to be antisymmetric if given any two distinct elements x and y , 
either (i) x and y are not related in any way, or (ii) if x and y are related, they can only 
be related in one direction. 

• A compact way to define antisymmetry is: if xRy and yRx , then we must have x = y. 

• Finally, a relation is said to be transitive if we can pass along the relation and relate two 
elements if they are related via a third element. 

• More precisely, R is transitive if xRy and y Rz implies that xRz. 
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Exercises 7.2 

1. For each relation in Problem 1 in Exercises 7.1, determine which of the five properties are 
satisfied. 

2. For each relation in Problem 3 in Exercises 7.1, determine which of the five properties are 
satisfied. 

3. For the relation in Problem 6 in Exercises 7.1, determine which of the five properties are 
satisfied. 

4. For the relation in Problem 7 in Exercises 7.1, determine which of the five properties are 
satisfied. 

5. For the relation in Problem 8 in Exercises 7.1, determine which of the five properties are 
satisfied. 

6. For the relation in Problem 9 in Exercises 7.1, determine which of the five properties are 
satisfied. 

7. Let S' be a nonempty set and define the relation A on p(S) by 

(i,y)e4»iny = f). 

It is clear that A is symmetric. 

(a) Explain why A is not reflexive. 

(b) Explain why A is not irreflexive. 

(c) Is A transitive? 

(d) Let S = {a, b , c}. Draw the directed graph for A 1 and find the incidence matrix that 
represents A. 

8. For each of these relations on N — {1}, determine which of the five properties are satisfied. 

(a) Ai = {(x,y) | x and y are relatively prime} 

(b) A 2 = {(x,y) | x and y are not relatively prime} 

9. For each of the following relations on N, determine which of the five properties are satisfied. 

(a) i?i = {(x,y) \ x divides y} 

(b) R 2 = {(x, y) | x + y is even} 

(c) R 3 = {(x,y) | xy is even} 

10. For each of the following relations on N, determine which of the five properties are satisfied. 

(a) Si = {(x,y) | y divides a:} 

(b) S 2 = {(x,y) | x + y is odd} 

(c) S 3 = {(x,y) | xy is odd} 

11. For each of the following relations on Z, determine which of the five properties are satisfied. 

(a) Ui = {(x,y) | x < y} 

(b) U 2 = {(x,y) | x-y is odd} 

(c) U 3 = {(a;, y) | 3 divides x + 2 y} 

12. For each of the following relations on Z, determine which of the five properties are satisfied. 

(a) Vi = {( x,y ) | xy > 0} 

(b) V 2 = {(x, y) | x - y is even} 

(c) V 3 = {(x, y) | x is a multiple of y} 
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7.3 Equivalence Relations 


Definition. A relation on a set A is an equivalence relation if it is reflexive, symmetric, 
and transitive. We often use the tilde notation a ~ b to denote an equivalence relation. <0> 

Example 7.3.1 The relations in Examples 7.2.4, 7.2.5, and 7.2.7, are equivalence relations, 
so are those in Hands-On Exercises 7.2.2 and 7.2.6. A 

Example 7.3.2 Define a relation ~ on Z by 

a ~ b <t=> a = b (mod 4). 

Verify that ~ is an equivalence relation. 

Solution: We need to check three properties: 

• It is obvious a = a (mod 4), hence a ~ a. The relation ~ is reflexive. 

• If a ~ b, then a = b (mod 4). It is clear that we also have b = a (mod 4). Hence, ~ is 
symmetric. 

• If a ~ b and b ~ c, then 


a = b (mod 4), and b = c (mod 4). 

It follows that a = c (mod 4). Thus a ~ c. This shows that ~ is transitive. 

Therefore, ~ is an equivalence relation. A 

Hands-On Exercise 7.3.1 Define a relation ~ on Z by 

a ~ b <t=> a = b (mod 6). 

Verify that ~ is an equivalence relation. 


A 


Hands-On Exercise 7.3.2 Let n > 2 be a positive integer. Define a relation ~ on Z by 

a ~ b 4$ a = b (mod n ) . 

Verify that ~ is an equivalence relation. 


A 
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Take a closer look at Example 7.3.2. All the integers having the same remainder when 
divided by 4 are related to each other. Define the sets 


[0] 

= {n € Z 

n mod 4 

0} 

= 4Z 

[1] 

= {n € Z 

1 mod 4 = 

1} 

= 1 + 4Z 

[2] 

= {n £ Z 

n mod 4 = 

2} 

= 2 + 4Z 

[3] 

= {n € Z 

n mod 4 = 

3} 

= 3 + 4Z 


It is clear that every integer belongs to exactly one of these four sets. Hence, 

Z = [0] U [1] U [2] U [3]. 

These four sets are pairwise disjoint, so Z is a disjoint union of these four sets. We say that 
{[0], [1], [2], [3]} is a partition of Z. 

Definition. A collection {Si, S 2 , . . . , S n } of nonempty subsets of S is said to be a partition 
of S if the subsets Si, S 2 , . . . , S n are pairwise disjoint (Si D Sj = 0 whenever i ^ j), and 

Si U S 2 U • • • U S„ = S. 

The subsets Si, S 2 , . . . , S n are called the parts or components of the partition. <C> 

Because of transitivity and symmetry, all the elements related to a fixed element must be 
related to each other. Thus, if we know one element in the group, we essentially know all its 
“relatives.” 

Definition. Let ~ be an equivalence relation on A. The set 

[a] = {x € A | x ~ a}. 

is called the equivalence class of a. <C> 

Example 7.3.3 In Example 7.2.4, each equivalence class of the relation S consists of all the 
triangles that are similar. Note that no triangle can belong to two different equivalence classes. 
This means that the equf valence classes are pairwise disjoint. 

In the same example, each equivalence class of the relation P consists of all the lines that 
are parallel. Again, take note that no line can belong to two different equivalence classes. Thus, 
the equivalence classes are pairwise disjoint. A 

Example 7.3.4 For the relation ~ on Z defined by a ~ 6 <t=> a = b (mod 4), there are 
four equivalence classes [0], [1], [2] and [3], and the set {[0], [1], [2], [3]} forms a partition of Z. 
Therefore, 

Z = [0] U [1] U [2] U [3], 

and the four components [0], [1], [2] and [3] are pairwise disjoint. A 

Hands-On Exercise 7.3.3 What are the equivalence classes of the relation ~ in Hands-On 
Exercise 7.3.1? 


A 

Hands-On Exercise 7.3.4 What are the equivalence classes of the relation ~ in in Hands-On 
Exercise 7.3.2? 


If you know one 
member in the gang, 
you know them all! 


A 
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Hands-On Exercise 7.3.5 For each of the equivalence relations mentioned in Example 7.3.1, 
determine its equivalence classes. 


A 

All the elements in the same equivalence class are related to each other. Therefore, the 
elements in [a] all share the same property that a enjoys, from the viewpoint of the relation 
~. In Example 7.3.4, the equivalence class [0] consists of elements that are multiples of 4. The 
equivalence class [1] consists of elements that, when divided by 4, leave 1 as the remainder, 
and similarly for the equivalence classes [2] and [3]. Because of the common bond between the 
elements in an equivalence class [a] , all these elements can be represented by any member within 
the equivalence class. This is the spirit behind the next theorem. 

Theorem 7.3.1 //~ is an equivalence relation on A, then a ~ b <t=> [a] = [£>] . 

Proof: We leave the proof as an exercise. ■ 

Every element in an 
equivalence class can 
be its representative. 

Example 7.3.5 Define ~ on a set of individuals in a community according to 

a ~ b 4$ a and b have the same last name. 

We have seen that ~ is an equivalence relation. Each equivalence class consists of all the 
individuals with the same last name in the community. Hence, for example, James Smith, Lucy 
Smith, and Peter Smith all belong to the same equivalence class. Any Smith can serve as its 
representative, so we can denote it as, for example, [Peter Smith]. ▲ 

Define ~ on M + according to 

x ~ y 44 x — y £ Z. 

Hence, two real numbers are related if and only if they have the same decimal parts. It is easy to 
verify that ~ is an equivalence relation, and each equivalence class [x] consists of all the positive 
real numbers having the same decimal parts as x has. Notice that 

K+ = [J [*]. 

cc6(0,l] 

which means that the equivalence classes [x], where x £ (0, 1], form a partition of R. ▲ 

Hands-On Exercise 7.3.6 Prove that the relation ~ in Example 7.3.6 is indeed an equivalence 
relation. 


Example 7.3.6 


One may regard equivalence classes as objects with many aliases. Every element in an 
equivalence class can serve as its representative. So we have to take extra care when we deal 
with equivalence classes. Do not be fooled by the representatives, and consider two apparently 
different equivalence classes to be distinct when in reality they may be identical. 


A 
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Hands-On Exercise 7.3.7 Define ~ on K according to 

x ~ y <£> x — y £ Z. 

Show that ~ is an equivalence class. True or false: —2.14 £ [5, 14]? Explain. 


A 

What makes equivalence relations so important is the following Fundamental Theorem 
on Equivalence Relations. 

Theorem 7.3.2 Given any equivalence relation on a nonempty set A, the set of equivalence 
classes forms a partition of A. Conversely , any partition {Ai,A 2 , . . . , A n } of a nonempty set A 
into a finite number of nonempty subsets induces an equivalence relation ~ on A, where a ~ b 
if and only if a,b £ Ai for some i (thus a and b belong to the same component). 

Proof: It is clear that A is the union of the equivalence classes induced by ~, so it remains to 
show that these equivalence classes are pairwise disjoint. Assume [a] (~l [6] ^ 0. Let x £ [a] n [ b\ . 
Then x £ [a] and x £ [6]. Having x £ [a] means x ~ a, and x £ [6] implies that x ~ b. Symmetry 
and transitivity imply that a ~ b. Theorem 7.3.1 assures that [a] = [6]. Therefore, if [a] ^ [6], 
then [a] (~l [6] = 0. This proves that the equivalence classes form a partition of A. 

Let A = A\ U Ai U • • • U A n be a partition of A , define the relation ~ on A according to 

x ~ y <£> x,y £ Ai for some i. 

It follows immediately from the definition that x ~ x, so the relation is reflexive. It is also clear 
that x ~ y implies y ~ x, hence, the relation is symmetric. Finally, if x ~ y and y ~ z, then 
x, y £ Ai for some i, and y, z £ Aj for some j. Since the A,s form a partition of A, the element 
y cannot belong to two components. This means i = j, hence, x, z £ Ai. This proves that ~ is 
transitive. Consequently, ~ is an equivalence relation. ■ 

The idea behind the theorem is rather simple. Each equivalence class consists of all the 
“relatives” from the same family, so obviously the set A can be divided into families (equivalence 
classes). These families do not share any common elements (hence pairwise disjoint), because 
Theorem 7.3.1 states that any two equivalence classes sharing some common elements must be 
identical. Therefore, the families form a partition of A. Conversely, given a partition V , we could 
define a relation that relates all members in the same component. This relation turns out to 
be an equivalence relation, with each component forming an equivalence class. This equivalence 
relation is referred to as the equivalence relation induced by V. 

Example 7.3.7 In Example 7.2.4, the relation S is an equivalence relation, and the equivalence 
classes are the sets of similar triangles, which form a partition of the set T ■ This means any 
triangle belongs to one and only one equivalence class. In other words, we can classify the 
triangles on a plane according to their three interior angles. 

The relation P in the same example is also an equivalence relation. Its equivalence classes 
are the sets of lines that are parallel. Every line on the plane belongs to exactly one equivalence 
class. Consequently, we can classify the lines on a plane by their slopes. A 

Example 7.3.8 Over Z*, define 

R 3 = {(to, n) | to, n £ Z* and mn > 0}. 
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It is not difficult to verify that R 3 is an equivalence relation. There are only two equivalence 
classes: [ 1 ] and [— 1 ], where [ 1 ] contains all the positive integers, and [— 1 ] all the negative 
integers. It is obvious that Z* = [1] U [— 1]. A 


As a line is the union 
of infinitely many 
points, a plane is the 
union of infinitely 
many parallel lines. 


Example 7.3.9 For each 6 6 i, define Lb to be the line in R 2 (which is also called the xy- 
plane) with equation y = 2x + b. Then C = {Lb \ b e R} is a partition of R 2 because given 
any point on R 2 , there is only one straight line with slope 2 that can pass through it. Such a 
partition induces an equivalence relation ~ defined by 

( p , q ) ~ (s, t) <t=> both (p, q) and (s, t) lie on Lb for some b. 


Thus, ( p , q) ~ (s, t) if and only if the two points ( p , q) and (s, t) he on the same straight line of 
slope 2. This means = 2. Therefore, we can restate the definition as 

(P, V) ~ (M) «=> q~t = 2 (p - s). 


For example, (1, 5) ~ (0, 3). In fact, [(1, 5)] corresponds to the line y = 2x + 3 or L 3 . Similarly, 
[(1, 1-25)] corresponds to the line y = 2x — 0.75 or L_ 0 . 75 . In general, Lb = [(0, b )]. A 


Hands-On Exercise 7.3.8 Consider the partition of R 2 (the cry-plane) 

R 2 = U 

beR 

where Lb is the line satisfying the equation y = 5cr + b. Determine the equivalence relation 
induced by this partition. 


A 

We have studied modular arithmetic extensively. In Hands-On Exercise 7.3.2, you have 
already proved the following result. 

Theorem 7.3.3 For any positive integer n > 2, the relation congruence modulo n is an equiv- 
alence relation on Z. 

We can now provide a more rigorous definition of Z„. 

Definition. Let n > 2 be an integer. The equivalence classes [0], [1], . . . , [n— 1] of the relation 
congruence modulo n are called the residue classes modulo n. The set 

Z„ = {[0], [1], . . . , [n - 1]} 

is called the set of residue classes modulo n. <0> 

Remark. We define two operations ® and 0 on the elements of Z n according to 

[a] 0 [b\ = [a + b] , and [a] © [6] = [ ab } . 

We will not go into the details, but we would like to remark that (Z ra ,©,©) forms an algebraic 
structure called ring. In practice, we seldom write Z„ = {[0], [1], ... , [n — 1]} because it is too 
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cumbersome. Instead, we just write = {0, 1, 2, . . . , n — 1}. However, what we really work 
with in Z n are the residue classes represented by the integers 0 through n — 1. <£> 

The incidence matrix of an equivalence relation exhibits a beautiful pattern. Conversely, by 
examining the incidence matrix of a relation, we can tell whether the relation is an equivalence 
relation. 

If we can rearrange the rows and columns of an incidence matrix so that the modified 
incidence matrix can be divided into blocks of submatrices containing entirely Is or entirely Os, 
such that the 1-submatrices lie on the diagonal, then the underlying relation Ft. is an equivalence 
relation. Here is the reason. Since the entries in each 1-submatrix are all Is, this means the 
corresponding elements are all related to each other. This is the notion of transitivity. Obviously, 
every element is related to itself. Since the 1-submatrices lie on the diagonal, the matrix, hence 
the relation, is symmetric. This proves that the underlying relation is an equivalence relation. 
Each equivalence class consists of all the elements that correspond to the row and columns in 
the same 1-matrix. 


Example 7.3.10 Let vl = {1,2,3, 4, 5} and define the relation R\ on A by 


Ri = {(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3), (4, 4), (4,4), (5, 4), (5, 5)}. 


It is clear from the incidence matrix (we add lines to make the 0- and 1-submatrices more 
outstanding) 



1 

2 

3 

4 

5 

1 

( 1 

1 

1 

0 

0 \ 

2 

1 

1 

1 

0 

0 

3 

1 

1 

1 

0 

0 

4 

0 

0 

0 

1 

1 

5 

V 0 

0 

0 

1 

1 / 


that R\ is an equivalence relation and that it has two equivalence classes: [1] = [2] = [3] = 
{1, 2, 3}, and [4] = [5] = {4, 5}, such that A = [1] U [4], A 


Example 7.3.11 Let A = {a,b,c,d}. Define the relation R 2 on A by 

R 2 = {(a, a), (a, c), (6, b ), (6, d), (c, a), (c, c), (d, 6), (d, d)}. 
After rewriting the incidence matrix 


abed a c b d 


a 

l 1 0 1 0 \ 

a 

( 1 1 

0 

0 

b 

0101' 

c 

1 1 

0 0 

c 

1010 

b 

0 0 

1 1 

d 

^ 0 1 0 1 ) 

d 

^ 0 0 

1 1 


it becomes clear that R 2 is an equivalence relation, with [a] = [c] = {a, c}, and [b] = [d] = {b, d}, 
such that A = [a] U [6] . A 


Hands-On Exercise 7.3.9 The relation S defined on the set {1, 2, 3, 4, 5, 6} is known to be 


5* = {(1, 1), (1,4), (2,2), (2,5), (2,6), (3,3), 

(4, 1), (4, 4), (5, 2), (5, 5), (5, 6), (6, 2), (6, 5), (6, 6)}. 
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Show that S is an equivalence relation by studying its incidence matrix, and rewriting it if 
necessary. Determine the contents of its equivalence classes. 


A 

Example 7.3.12 Find the equivalence relation R induced by the partition 

7> = {{1},{3},{2,4,5,6}} 

of A = {1,2, 3, 4, 5, 6}. 

Solution: From the two 1-element equivalence classes {1} and {3}, we find two ordered pairs 
(1, 1) and (3,3) that belong to R. From the equivalence class {2,4, 5, 6}, any pair of elements 
produce an ordered pair that belongs to R. Therefore, 

R = {(1, 1), (3, 3), (2, 2), (2, 4), (2, 5), (2, 6), (4, 2), (4, 4), (4, 5), (4, 6), 

(5, 2), (5, 4), (5, 5), (5, 6), (6, 2), (6, 4), (6, 5), (6, 6)}. 

Alternatively, we can construct the incidence matrix 



1 

3 

2 

4 

5 

6 

1 

( 1 

0 

0 

0 

0 

0 \ 

3 

0 

1 

0 

0 

0 

0 

2 

0 

0 

1 

1 

1 

1 

4 

0 

0 

1 

1 

1 

1 

5 

0 

0 

1 

1 

1 

1 

6 

V o 

0 

1 

1 

1 

1 / 


from which the ordered pairs in R can be easily obtained. A 

Hands-On Exercise 7.3.10 Find the equivalence relation R induced by the partition 

V = {{a,d},{b,c,g},{e,f}} 

of A = {a, &, c, d, e, /, g} by listing all its ordered pairs (the roster method). 


A 


Summary and Review 

• A relation R on a set A is an equivalence relation if it is reflexive, symmetric, and transitive. 

• If R is an equivalence relation on the set A, its equivalence classes form a partition of A. 

• In each equivalence class, all the elements are related and every element in A belongs to 
one and only one equivalence class. 

• The relation R determines the membership in each equivalence class, and every element 
in the equivalence class can be used to represent that equivalence class. 

• In a sense, if you know one member within an equivalence class, you also know all the 
other elements in the equivalence class because they are all related according to R. 

• Conversely, given a partition of A , we can use it to define an equivalence relation by 
declaring two elements to be related if they belong to the same component in the partition. 
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Exercises 7.3 

1. Show that each of the following relations ~ on Z is an equivalence relation, and find its 
equivalence classes. 

(a) to ~ n <t=> \m — 3| = | n — 3| 

(b) m ~ n <t=> m +n is even 

2. Show that each of the following relations ~ on Z is an equivalence relation, and find its 
equivalence classes. 

(a) m ~ n <t=> 3 | (m + 2 n) 

(b) m ~ n <t=> 5 j (2r?z + 3n) 

3. Let T be a fixed subset of a nonempty set S. Define the relation ~ on p(S) by 

i~y«inr = ynT, 

Show that ~ is an equivalence relation. In particular, let S = {1,2, 3, 4} and T = {1,3}. 

(a) True or false: {1, 2, 4} ~ {1, 4, 5}? 

(b) How about {1,2,4} ~ {1,3,4}? 

(c) Find [{1,5}] 

(d) Describe [X] for any X G p(S). 

4. For each of the following relations ~ on K. x K, determine whether it is an equivalence 
relation. For those that are, describe geometrically the equivalence class [(a, b)]. 

(a) (*1,2/1) ~ (*2,2/2) ^ 2/i ~ x\ = y 2 - x\. 

(b) (xi, yi ) ~ (x 2 , j/2 ) ^ {xx - l) 2 +y\ = (x 2 - l) 2 + y\ 

5. For each of the following relations ~ on K x R, determine whether it is an equivalence 
relation. For those that are, describe geometrically the equivalence class [(a, 6)]. 

(a) (*1,2/1) ~ (*2, 2/2) xi + 1/2 = *2 + 2/i 

(b) (*1,2/1) ~ (*2, 2/2) ^ (*1 - x 2 )(2/i - 2/2) = 0 

6. For each of the following relations ~ on K x K, determine whether it is an equivalence 
relation. For those that are, describe geometrically the equivalence class [(a, 6)]. 

(a) (*1,2/1) ~ (£2,2/2) <=> |*i| + I2/1I = |* 2 | + 1 2/2 1 

(b) (*1,2/1) ~ (£2,2/2) ^ *12/1 = £22/2 

7. Define the relation ~ on Q by 

* ~ ?/ ^ 2(£ — y) € Z. 

Show that ~ is an equivalence relation. Describe the equivalence classes [0] and [ } ] . 

8. Define the relation ~ on Q by 

£ ~ 2/ ^ G Z ' 

Show that ~ is an equivalence relation. Describe the equivalence classes [0], [1] and [{]. 

9. Consider the following relation on {a, b, c, d, ej: 

R = {(a, a), (a, c), (a, e), (6, 6), ( b , d), (c, a), (c, c), (c, e), 

(d, 6), (d, d), (e, a), (e, c), (e, e)}. 

Show that it is an equivalence relation, and describe its equivalence classes. 

Hint: Use the matrix representation of the relation. 
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10. Each part below gives a partition of A = {a, 6, c, d, e, /, g}. Find the equivalence relation 
on A induced by the partition. 

(a) Vi = Ua,b},{c,d},{e,f},{g}} (b) V 2 = {{a, c, e, g}, {&, d, /}} 

(c) V 3 = {{a,b,d,e,f},{c,g}} (d) P 4 = {{a,b,c,d,e, f,g}} 

11. Let ~ be an equivalence relation on A. Prove that if a ~ b, then [a] = [6]. 

12. Let ~ be an equivalence relation on A. Prove that if [a] = [6], then a ~ b. 


7.4 Partial and Total Ordering 

Two special relations occur frequently in mathematics. Both have to do with some sort of 
ordering of the elements in a set. A branch of mathematics is devoted to their study. As you 
can tell from the brief discussion in this section, they cover many familiar concepts. 

Definition. A relation on a nonempty set A is called a partial ordering or a partial- order 
relation if it is reflexive, antisymmetric, and transitive. We often use A to denote a partial 
ordering, and called (A, A) a partially ordered set or a poset. 

Example 7.4.1 The usual “less than or equal to” relation on R , denoted <, is a perfect 
example of partial ordering. In fact, this is the reason why we adopt the notation A, as it 
reflects the similarities between the two symbols. A 

Example 7.4.2 Another classic example of partial ordering is the subset relation, denoted C, 
on p(S), where S is any set of elements. Observe that S can be empty, in which case p(0) = {0}, 
and (p(0), C) is obviously a partially ordered set. A 

Example 7.4.3 Another standard example of poset is (N, |). It is easy to verify that the 
“divides” relation over the natural numbers is a partial ordering. Can you explain why (Z*, |) 
is not a poset? A 

Hands-On Exercise 7.4.1 Find a counterexample to illustrate why the “divides” relation, 
denoted | , over Z* is not antisymmetric. Is the “divides” relation reflexive over Z? How about 
transitivity? 


A 


Hands-On Exercise 7.4.2 Define the relation C on p({a, b, c, d}) according to 

scr o scrujs}. 

Is (p({a,b,c,d},Q) a poset? Which properties it does not possess? Explain. 


A 
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Obviously, if a A b but a ^ b, then we can write a -< b. We sometimes say a precedes 6, or 
b succeeds a. We also say a is the predecessor of b, or b is the successor of a. 

The digraph for a poset can be simplified. Since a is always related to a itself, it is redundant 
to draw a loop around every vertex. Since a A b and b A c always imply that a A c, there is 
no need to include the arc (directed edge) from a to c. So we follow the convention that we 
only draw an arc from a to b if a A b and there does not exist another element t such that 
a A t and t A b. Lastly, if a A b. we can place b above a so that all the arcs are pointing 
upward. This suggests that we can use undirected lines to make the graph easier to read. All 
these modifications lead to a much simpler graphical representation called a Hasse diagram. 


Example 7.4.4 It is clear that ({1,2,3,4,6,12}, |) is a poset. Its Hasse diagram is displayed 
below. 


12 



4 6 



2 3 



1 


In this convention of using undirected line, the A relation (hence, the ordering of the elements) 
is read from the bottom up. A 


Hands-On Exercise 7.4.3 Draw the Hasse diagram for the poset ({1, 2, 3,4, 6, 9, 12, 18, 36}, |). 


A 


The definition of a poset does not require every pair of distinct elements to be comparable. 
This means there may exist a ^ b such that a ^ b and b a. An example can be found in the 
numbers 2 and 3 in Example 7.4.4. If a partial ordering has the additional property that for 
any two distinct elements a and b, either a A b or b A a (hence, any pair of distinct elements 
are comparable), we call the relation a total ordering. 


Example 7.4.5 The poset (N, <) is a totally ordered set. The poset ({1, 5, 25, 125}, |) is also 
a totally ordered set. Its Hasse diagram is shown below. 
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125 


25 


5 


1 

It is clear that the Hasse diagram of any totally ordered set will look like the one displayed 
above. Consequently, a total ordering is also called a linear ordering. A totally ordered set is 
also called a chain. A 

Hands-On Exercise 7.4.4 Construct the Hasse diagram for the poset ({1, 2, 4, 18, 16}, |). Is 
it a totally ordered set? 


A 


Hands-On Exercise 7.4.5 Construct the Hasse diagram for the poset (p({a, 6, c}), C). 


A 


Summary and Review 

• A relation that is reflexive, antisymmetric, and transitive is called a partial ordering. 

• A set with a partial ordering is called a partially ordered set or a poset. 

• A poset with every pair of distinct elements comparable is called a totally ordered set. 

• A total ordering is also called a linear ordering, and a totally ordered set is also called a 
chain. 

Exercises 7.4 

1. Let A be the set of natural numbers that are divisors of 30. Construct the Hasse diagram 
of (A, |). 
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2. Let S = {{a}, {&}, {c}, {a, b}, {a, c}, {b, c}}. Construct the Hasse diagram for (S, C). 

3. Let ( A , X) be a poset, and B a nonempty subset of A. Show that (B, A) is also a poset. 
Naturally, we call ( B , A) a subposet of ( A , A). 

4. Dehne the relation A on Z according to 

a A b <t=> a = b or a mod 3 < b mod 3. 

(a) Show that (Z, A) is a poset. 

(b) Let B = {— 3, — 2, — 1, 0, 1, 2, — 3}. Construct the Hasse diagram for the subposet 
(■ B,A ). 

5. Dehne the relation A on Z according to 

a A b -£=> a = b or |a| < |6|. 

(a) Show that (Z, A) is a poset. 

(b) Construct the Hasse diagram for the subposet ( B , ^), where B = {—2, — 1, 0, 1, 2}. 

6. Dehne the relation A on Z x Z according to 

(a, 6) ^ (c, d) (a, 6) = (c, d) or a 2 + b 2 < c 2 + d 2 . 

(a) Show that (Z x Z, is a poset. 

(b) Construct the Hasse diagram for the subposet ( B , ^), where B = {0, 1, 2} x {0, 1, 2}. 

7. Construct an example of a subset B of p({a, 6, c, d}) such that (B, C) is a totally ordered 
set. 

8. Let 

A = {(to, n) | m, n G N and gcd(TO, n) = 1}, 
and dehne the relation A on A according to 

(a, b) A (c, d) ad < be. 

Prove that (A, A) is a totally ordered set. 
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Combinatorics 


8.1 What is Combinatorics? 

Combinatorics studies the arrangements of objects according to some rules. The questions that 
can be asked include 

• Existence. Do the arrangements exist? 

• Classification. If the arrangements exist, how can we characterize and classify them? 

• Enumeration. How many arrangements are there? 

• Construction. Is there an algorithm for constructing all the arrangements? 

Example 8.1.1 In how many ways can five people be seated at a round table? What if a 
certain pair of them refuses to sit next to one another? What if there are n people? A 

Example 8.1.2 Given integers n\ > 712 > • • • > nt > 1, a Young tableau of the shape 
(ni, 7x2, • ■ • , n t ) consists of t rows of left-justified cells, with rij cells in the ?'th row (counting from 
the top row). These cells are occupied by the integers 1 through n, where n = n± +712 + • • • + nt, 
such that the entries are in descending order across each row from left to right, and down each 
column from top to bottom. For instance, the three Young tableaux of the shape (3,1) are 
depicted in Figure 8.1. 


4 

3 2 

4 

3 1 

4 

2 1 

1 


2 


3 



Figure 8.1: The three Young tableaux of the shape (3, 1). 

It is known that there are 35 Young tableaux of the shape (4,2, 1). Can you list all of them? 
In general, one may ask, how many Young tableaux are there of shape (ni, ri2 , . . . , nt), and how 
can we generate all of them? A 

Example 8.1.3 A binary string is a sequence of digits, each of which being 0 or 1. Let a n 
be the number of binary strings of length n that do not contain consecutive Is. It is easy to 
check that cq = 2, <22 = 3, and <23 = 5. What is the general formula for a n ? A 

Example 8.1.4 The complexity of an algorithm tells us how many operations it requires. By 
comparing the complexity of several algorithms for solving the same problem, we can determine 
which one is most efficient. Let b n be the number of operations required to solve a problem of 
size n. If it is known that 

b n = 2 b n —\ -\- 3h„_2, n ^ 3, 

where bi = 1 and 62 = 3, what is the general formula for b n ? A 
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8.2 Addition and Multiplication Principles 

Recall that the cardinality of a finite set A , denoted |A|, is the number of elements it contains. 

Example 8.2.1 If A — {—1,0,2}, then \A\ = 3. Also, 

|{ 2 }| = 1 , 

|{2, 5, —1, — 3} | = 4, 

|{x € R. | x 2 = 1} | = 2. 

Notice that |0| = 0, because an empty set does not contain any element. A 

It becomes more interesting when we consider the cardinality of a union or an intersection 


of two or more sets. 

Example 8.2.2 Determine | A U B\ and \A n B\ if A = {2, 5} and B = {7, 9, 10}. 

Solution: Since A U B = {2,5,7,9,10}, and A n B = 0, it is clear that \A U B\ = 5, and 

|Ans| = o. a 

Example 8.2.3 Determine | A U B\ and \A n B\ if A = {2, 5} and B = {5, 9, 10}. 

Solution: Since A U B = {2,5,9, 10}, and A (~l B = {5}, it is clear that \A U B\ = 4, and 

\AnB\ = i. a 


Hands-On Exercise 8.2.1 Let A = {n £ Z \ —5 < n < 3}, and B = {n £ Z | — 3 < n < 5}. 
Evaluate |Anl?| and |AuJ3|. 


Use the addition 
principle only when 
the sets do not 
overlap. 


Watch for double 
counting! 


A 

The difference between the last two examples is whether the two sets A and B have a 
nonempty intersection. Two sets A and B are disjoint if A D B = 0. A collection of 
sets A\, A 2 , . . . , A n is said to be pairwise disjoint if A,; n Aj =0 whenever i j. When 
Ai, A 2 , . . . , A n are pairwise disjoint, their union is called a disjoint union. 

Example 8.2.4 Let A = {1, 0, -1}, B = {-2, 0, 2}, C = {-2, 2} and D = {3, 4, 5}. Then A, 
C, and D are pairwise disjoint, so are B and D , but A, B, and C are not. A 

Theorem 8.2.1 (Addition Principle) If the finite sets A 1; A 2 , . . . , A n are pairwise disjoint, 
then 

\Ai U A 2 U • • • U A n | = | Ai| + IA 2 I + • • • + \A n \. 

Use the addition principle if we can break down the problems into cases , and count how many 
items or choices we have in each case. The total number is the sum of these individual counts. 
The idea is, instead of counting a large set, we divide it up into several smaller subsets, and 
count the size of each of them. The cardinality of the original set is the sum of the cardinalities 
of the smaller subsets. This divide-and-conquer approach works perfectly only when the sets 
are pairwise disjoint. 

Example 8.2.5 To find the number of students present at a lecture, the teacher counts how 
many students there are in each row, then adds up the numbers to obtain the total count. A 

When the sets are not disjoint, the addition principle does not give us the right answer 
because the elements belonging to the intersection are counted more than once. We have to 
compensate the over-counting by subtracting the number of times these elements are over- 
counted. The simplest case covers two sets. 
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Theorem 8.2.2 (Principle of Inclusion-Exclusion (PIE)) For any finite sets A and B , 
we have 


\A U B\ = \A\ + \B\ - \AnB\. 


In general, use PIE to 
find the cardinality of 
a union of sets. 


Proof: Observe that A U B is the disjoint union of three sets 

A U B = (A - B) U (A n B) U {B - A). 

It is clear that \A — B\ = |A| — \A fl B |, and \B — A\ = \B\ — \A n B\. Therefore, 

\AUB\ = \A- B\ + \AnB\ + \B - A\ 

= (\A\-\AnB\) + \AnB\ + {\B\-\AnB\) 

= \A\ + \B\ - \ArB\, 

which is what we have to prove. ■ 


The principle of inclusion-exclusion also works if A and B are disjoint, because in such an 
event, |Afl B\ = 0, reducing PIE to the addition principle. 

Example 8.2.6 Assume the current enrollment at a college is 4689, with 60 students taking 
MATH 210, 42 taking CSIT 260, and 24 taking both. Together, how many different students 
are taking these two courses? In other words, determine the number of students who are taking 
either MATH 210 or CSIT 260. 


Solution: Let A be the set of students taking MATH 210, and B the set of students taking 
CSIT 260, Then, \A\ = 60, \B\ = 42, and \A fl B\ = 24. We want to find \A U B\. According to 
PIE, 

\AUB\ = \A\ + \B\ - \A n B\ = 60 + 42 - 24 = 78. 

Therefore, 78 students are taking either MATH 210 or CSIT 260. A 

Example 8.2.7 Among 4689 students, 2112 of them have earned at least 60 credit hours and 
2678 of them have earned at most 60 credit hours. How many students are there who have 
accumulated exactly 60 hours? 


Solution: Let A be the set of students who have earned at least 60 credit hours, and B be the 
set of students who have earned at most 60 credit hours. We want to find |An B\. According 
to PIE, 

4689 = \ A U B\ = \A\ + \B\ -\AnB\= 2112 + 2678 -\AnB\. 


Hence, 


\AnB\ = (2112 + 2678) - 4689 = 101. 


There are 101 students who have accumulated exactly 60 credit hours. 


A 


Hands-On Exercise 8.2.2 The attendance at two consecutive college football games was 
72397 and 69211 respectively. If 45713 people attended both games, how many different people 
have watched the games? 


A 
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Hands-On Exercise 8.2.3 The attendance at two consecutive college football games was 
72397 and 69211 respectively. If 93478 different individuals attended these two games, how 
many have gone to both? 


A 


Using complement 
may simplify a 
counting problem. 


Sometimes, it is easy to work with the complement of a set. 
Lemma 8.2.3 For any finite set S , we have 

\s\ = \u\-\s\, 

where U is the universal set containing S. 


Example 8.2.8 In Example 8.2.6, since there are 78 students taking either MATH 210 or 
CSIT 260, the number of students taking neither is 4689 — 78 = 4611. ▲ 

The principle of inclusion-exclusion can be extended to any number of sets. The situation is 
more complicated, because some elements may be double-counted, some triple-counted, etc. To 
give you a taste of the general result, here is the principle of inclusion-exclusion for three sets. 

Theorem 8.2.4 For any three finite sets A, B and C , 

\AUBUB\ = \A\ + \B\ + \C\ — \AnB\ — \Ar\C\ - \BnC\ + \AnBnC\. 

Proof: The union A\J B U C is the disjoint union of seven subsets: 

A-(BUC), B-(CUd), C-(A\JB), {A n B) - (A n B n C), 
(BnC)-(AnBnC), (end) - (inBn C), and An bug. 

We can apply an argument similar to the one used in the union of two sets to complete the 
proof. We leave the details as an exercise. ■ 

Hands-On Exercise 8.2.4 A group of students claims that each of them had seen at least one 
part of the Back to the Future trilogy. A quick show of hands reveals that 

• 47 had watched Part I; 

• 43 had watched Part II; 

• 32 had watched Part III; 

• 33 had watched both Parts I and II; 

• 27 had watched both Parts I and III; 

• 25 had watched both Parts II and III; 

• 22 had watched all three parts. 

How many students are there in the group? 


A 


Another useful counting technique is the multiplication principle. 
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Theorem 8.2.5 (Multiplication Principle) For any finite sets A and B, we have 

\AxB\ = \A\-\B\. 

Clearly, this can be extended to an ?r-fold Cartesian product. 

Corollary 8.2.6 For any finite sets Ai, A 2 , ■ ■ . , A n , we have 

|Ai x A 2 x • • • x A n \ = |Ai| • \A 2 \ \A n \. 

In many applications, it may be helpful to use an equivalent form. 

Theorem 8.2.7 (Multiplication Principle: Alternate Form) Ifataskconsistsofkst.eps, 
and if there are n* ways to finish step i, then the entire job can be completed in n\n 2 . . . Uk 
different ways. 


Now that we have two counting techniques, the addition principle and the multiplication 
principle, which one should we use? The major difference between them is whether 

• the jobs can be divided into cases, groups, or categories ; or 

• each job can be broken up into steps. 

In practice, it helps to draw a picture of the configurations that we are counting. 


Which one should we 
use: the addition or 
multiplication 
principle? Look for 
keywords such as 
cases or groups versus 
steps or positions. 


Example 8.2.9 How many different license plates are there if a standard license plate consists 
of three letters followed by three digits? 


Solution: We need to decide how many choices we have in each position. Draw a picture to 
show the configuration. Draw six lines to represent the six positions. Above each line, describe 
briefly the possible candidates for that position, and under each line, write the the number of 
choices. 


choices: 

any 

letter 

any 

letter 

any 

letter 

any 

digit 

any 

digit 

any 

digit 

# of choices: 

26 

26 

26 

10 

10 

10 


This left-to-right configuration suggests that the multiplication principle should be used. The 
answer is 26 • 26 • 26 • 10 • 10 • 10 = 260 3 . 

As you become more experienced, you can argue directly, as follows. There are 26 choices for 
each of the three letters, and 10 choices for each digit. So there are 26 • 26 • 26 • 10 • 10 • 10 = 260 3 
different license plates. A 

Example 8.2.10 Find the number of positive integers not exceeding 999 that end with 7. 
Solution 1: The integers can have one, two, or three digits, so we have to analyze three cases. 

• Case 1. There is only one integer with one digit, namely, the integer 7. 

• Case 2. If there are two digits, the first could be any digit between 1 and 9, and the last 
digit must be 7. 


choices: 1-9 7 


# of choices: 9 


1 


This gives us nine choices. 
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• Case 3. If there are three digits, the first digit could be any digit between 1 and 9, the 
second any digit between 0 and 9, and the last digit must be 7. 


choices: 

1-9 

any 

digit 

7 

# of choices: 

9 

10 

1 


Hence, there are 90 integers in this case. 

Combining the three cases, we have a total of 1+9+90 = 100 integers that meet the requirements. 

Solution 2: The integers could be written as three-digit integers if we allow 0 as the leading 
digits. For instance, 7 can be written as 007, and 34 as 034. Under this agreement, we have to 
fill three positions where the last one is always occupied by the digit 7. The first two digits are 
0, 1, 2, . . . , 8, or 9, so there are 10 choices for each position. 



any 

any 


choices: 

digit 

digit 

7 

# of choices: 

10 

10 

1 


Together, there are 10 • 10 = 100 such integers. A 

Hands-On Exercise 8.2.5 How many natural numbers less than 1000000 are there that end 
with the digit 3? 


A 

Hands-On Exercise 8.2.6 How many natural numbers less than 10000 are there that end 
with the digit 0? 


A 

Example 8.2.11 Determine the number of four-digit positive integers without repeated digits. 

Solution: We want to determine how many choices there are for each place value. The first 
digit has nine choices because it cannot be 0. Once the first digit is chosen, there are nine choices 
left for the second digit; and then eight choices for the next digit, and seven choices for the last 
digit. Together, we have 9 • 9 • 8 • 7 = 4536 four-digit positive integers that do not contain any 
repeated digits. Question: Can we start counting from the last digit? A 

Hands-On Exercise 8.2.7 How many six-digit natural numbers are there that do not have 
any repeated digit? 


A 

Example 8.2.12 Determine |p(£)|, where S is an n-element set. 
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Solution: We want to determine the number of ways to form a subset. Let the n elements be 
si, S 2 , ■ ■ ■ , s n . To form a subset, we go through each element s-i and decide whether it should be 
included in the subset, thus there are two choices for each element. 

element: si S 2 s n 

choices: Y/N Y/N . . . Y/N 

# of choices: 2 2 2 

We have 2 • 2 • • • • -2 = 2 n ways to form the subsets. Thus, |p(5)| =2". A 

n factors 


Example 8.2.13 How many two-digit positive integers do not have consecutive 5s? 

Solution 1: There are three disjoint cases: 

(i) both digits are not 5, 

(ii) only the first digit is 5, and 

(iii) only the last digit is 5. 

There are 8-9 + 9 + 8 = 89 integers that meet the requirement. 

Solution 2: An easier solution is to consider the complement of the problem. There is only 
one integer with consecutive 5s, namely, the integer 55. There are 90 two-digit integers, hence 
90 — 1 = 89 of them do not have consecutive 5s. A 

Hands-On Exercise 8.2.8 How many three-digit natural numbers are there that do not have 
consecutive 4s? 


A 

Example 8.2.14 In how many ways can we draw a sequence of three cards from a standard 
deck of 52 cards? 

Solution: This is a trick question! The answer depends on whether we can return a drawn card 
to the deck. With replacement, the answer is 52 3 ; without replacement, it is 52 • 51 • 50. A 

Example 8.2.15 A standard New York State license plate consists of three letters followed 
by four digits. Determine the number of standard New York State license plates with K as the 
first letter or 8 as the first digit. 

Solution: The keyword “or” suggests that we are looking at a union, hence, we have to apply 
PIE. We need to analyze three possibilities: 

• There are 26 2 • 10 4 license plates with K as the first letter. 

• There are 26 3 • 10 3 license plates with 8 as the first digit. 

• There are 26 2 • 10 3 license plates with K as the first letter and 8 as the first digit. 

The answer is 26 2 • 10 4 + 26 3 • 10 3 — 26 2 • 10 3 . A 

Hands-On Exercise 8.2.9 To access personal account information, a customer could log in 
to the bank’s web site with a PIN consisting of two letters followed by 


It’s easier to use 
complement! 


With or without 
replacements (or 
repetitions) makes 
big difference. 
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(a) exactly four digits, 

(b) at most six digits, 

(c) at least two but at most 6 digits. 

How many different PINs are there in each case? 


A 


Summary and Review 

• Use the addition principle if the problem can be divided into cases. Make sure the cases 
do not overlap. 

• If the cases overlap, the number of objects belonging to the overlapping cases must be 
subtracted from the total to obtain the correct count. 

• In particular, the principle of inclusion-exclusion states that |HUH| = \A\ + \B\ — \ AtlB\. 

• Use the multiplication principle if the problem can be solved in several steps. 

• How can we get started? Imagine you want to list all the possibilities, what is a systematic 
way of doing so? Follow the steps, and count how many objects you would end up with. 

• It may be helpful to use a schematic diagram. Draw one line for each step. Above the lines, 
write the choices. Below the lines, write the number of choices. Apply the multiplication 
principle to finish the problem. 

• If there are other cases involved, repeat, and add the results from all the possible cases. 

Exercises 8.2 

1. A professor surveyed the 98 students in her class to count how many of them had watched 
at least one of the three films in The Lord of the Rings trilogy. This is what she found: 

• 74 had watched Part I; 

• 57 had watched Part II; 

• 66 had watched Part III; 

• 52 had watched both Parts I and II; 

• 51 had watched both Parts I and III; 

• 45 had watched both Parts II and III; 

• 43 had watched all three parts. 

How many students did not watch any one of these three movies? 

2. Forty-six students in a film class told the professor that they had watched at least one of 
the three films in The Godfather trilogy. Further inquiry led to the following data: 

• 41 had watched Part I; 

• 37 had watched Part II; 

• 33 had watched Part III; 

• 33 had watched both Parts I and II; 

• 30 had watched both Parts I and III; 

• 29 had watched both Parts II and III. 

(a) How many students had watched all three films? 

(b) How many students had watched only Part I? 
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(c) How many students had watched only Part II? 

(d) How many students had watched only Part III? 

3. Joe has 10 dress shirts and seven bow ties. In how many ways can he match the shirts 
with bow ties? 

4. A social security number is a sequence of nine digits. Determine the number of social 
security numbers that satisfy the following conditions: 

(a) There are no restrictions. 

(b) The digit 8 is never used. 

(c) The sequence does not begin or end with 8. 

(d) No digit is used more than once. 

5. A professor has seven books on discrete mathematics, five on number theory, and four on 
abstract algebra. In how many ways can a student borrow two books not both on the 
same subject? 

Hint : Which two subjects would the student choose? 

6. How many different collections of cans can be formed from five identical Cola-Cola cans, 
four identical Seven-Up cans, and seven identical Mountain Dew cans? 

Hint: How many cans of Cola-Cola, Seven-Up, and Mountain Dew would you pick? 

7. How many five-letter words (technically, we should call them strings, because we do not 
care if they make sense) can be formed using the letters A, B, C, and D, with repetitions 
allowed. How many of them do not contain the substring BAD? 

Hint: For the second question, consider using a complement. 

8. How many different five-digit integers can be formed using the digits 1, 3, 3, 3, 5? 

Hint: The three digits 3 are identical, so we cannot tell the difference between them. 
Consequently, what really matters is where we put the digits 1 and 5. Once we place the 
digits 1 and 5, the remaining three positions must be occupied by the digits 3. 

9. Four cards are chosen at random from a standard deck of 52 playing cards, with replace- 
ment allowed. This means after choosing a card, the card is return to the deck, and the 
deck is reshuffled before another card is selected at random. Determine the number of 
such four-card sequences if 

(a) There are no restrictions. 

(b) None of the cards can be spades. 

(c) All four cards are from the same suit. 

(d) The first card is an ace and the second card is not a king. 

(e) At least one of the four cards is an ace. 

10. Three different mathematics final examinations and two different computer science final 
examinations are to be scheduled during a five-day period. Determine the number of ways 
to schedule these final examinations from 11 AM to 1 PM if 

(a) There are no restrictions. 

(b) No two examinations can be scheduled on the same day. 

(c) No two examinations from the same department can be scheduled on the same day. 

(d) Each mathematics examination must be the only examination for the day on which 
it is scheduled. 

11. Determine the number of four-digit positive integers that satisfy the following conditions: 
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The keyword 
arrangement; 
no repetition 
allowed. 


(a) There are no restrictions. 

(b) No integer contains the digit 8. 

(c) Every integer contains the digit 8 at least once. 

(d) Every integer is a palindrome (A positive integer is a palindrome if it remains the 
same when read backward, for example, 3773 and 47874). 

12. A box contains 12 distinct colored balls (for instance, we could label them as 1, 2, . . . , 12 
to distinguish them). Three of them are red, four are yellow, and five are green. Three 
balls are selected at random from the box, with replacement. Determine the number of 
sequences that satisfy the following conditions: 

(a) There are no restrictions. 

(b) The first ball is red, the second is yellow, and the third is green. 

(c) The first ball is red, and the second and third balls are green. 

(d) Exactly two balls are yellow. 

(e) All three balls are green. 

(f) All three balls are the same color. 

(g) At least one of the three balls is red. 

13. Let A = {a, b, c, d, e, /} and B = {1, 2, 3, 4, 5, 6, 7, 8}. Determine the number of functions 

that satisfy the following conditions: 

(a) There are no restrictions. 

(b) / is one-to-one. 

(c) / is onto. 

(d) f(x) is odd for at least one x in A. 

(e) /(a) = 3 or f(b) is odd. 

(f) r i ( 4 ) = {fl}. 

14. How many onto functions are there from an n-element set A to {a, &}? 

8.3 Permutations 

Let A be a finite set with n elements. For 1 < r < n, an r -permutation of A is an ordered 
selection of r distinct elements from A. In other words, it is the linear arrangement of r distinct 
objects aia 2 . . . a r , where a,; G A for each i. The number of r-permutations of an n-element set 
is denoted by P(n, r). It also appears in many other forms and names. 

• The number of permutations of n objects, taken r at a time without replacement. 

• The number of ways to arrange n objects (in a sequence), taken r at a time without 
replacement. 

All of them refer to the same number P(n,r). The keywords are: 

is (i) “ Permutation ” or “ arrangement ,” both of which suggest that order does matter. 

be sure (ii) “ Without replacement ” means the entries in the permutation/arrangement are distinct. 

is 

In some textbooks, the notation P(n,r) is also written as P” or n P r . 

Example 8.3.1 The 1-permutations of {a, b, c, d} are 

a, b, c, d. 

Consequently, P(4, 1) = 4. The 2-permutations of {a, b , c, d} are 


ab , 

ac, 

ad, 

ba , 

be, 

bd , 

ca, 

cb, 

cd , 

da , 

db, 

dc. 
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Hence, P(4, 2) = 12. What are the 3-permutations and 4-permutations of {a, b, c, d}? Can you 
explain why the numbers of 3-permutations and 4-permutations are equal? A 

Computing the value of P(n, r ) is easy. We want to arrange r objects in a sequence. These r 
objects are to be selected from a pool of n items. Hence there are n ways to fill the first position. 
Once we settle with the first position, whatever we put there cannot be used again. We are left 
with n—1 choices for the second position. Likewise, once it is filled, there are only n — 2 choices 
for the third position. Now it is clear that P(n,r) is the product of r numbers of the form n, 
n — 1, n — 2, ... . What is the last number in this list? There are r — 1 numbers before it, so it 
must be 7i — (r — 1) = 7i — r + 1. 

Theorem 8.3.1 For all integers n and r satisfying 1 < r < n, 

Tl\ 

Pin , r) = n(n — 1) • • • (n — r + 1) = 

(n — r)\ 

Although the formula P(n, r) = is rather easy to remember, the other form 

P(?i, r) = n(n — 1) • • • (n — r + 1) 

v v 

r 

is actually more useful in numeric computation, especially when it is done by hand. We multiply 
n by the next smaller integer n—1, and then the next smaller integer n — 2, and so forth, until 
we have a product of r consecutive factors. For instance, 

P(4,2) =4-3 = 12, and P(9, 3) = 9 • 8 • 7 = 504. 

How about P(n, 1) and P(n, 2)? 

Example 8.3.2 How would you compute the value of P( 278, 3) by hand, or if your calculator 
does not have that n P r button? 

Solution: We find P( 278, 3) = 278 • 277 • 276 = 21253656. A 

Hands-On Exercise 8.3.1 Compute P(21,4) by hand. 


A 

Remark. It follows from the first version of the formula that P(n, n) = n\. The second version 
reduces to 

n! 

n! = P(n, n) = — . 

Consequently, to make the second version works, we have to define 0! = 1. <0> 

Remark. In your homework assignments, quizzes, tests, and final exam, it is perfectly fine to 
use the notation P(n,r) in your answers. In fact, leaving the answers in terms of P(n,r) gives 
others a clue to how you obtained the answer. <0> 

It is often easier and less confusing if we use the multiplication principle. Once you realize the 
answer involves P(n, r), it is not difficult to figure out the values of n and r. A good start, before 
jumping into any calculation , is to ask yourself, how would you list the possible arrangements? 
Also, try constructing some examples. These can give you an idea of how many choices you 
have in each position. 


The formula for 
P{n, r) looks like a 
factorial; except that 
it stops at n — r + 1 
instead of 1. Just 
remember, it contains 
exactly r factors. 


You may leave the 
answers in terms of 
P(n, r). 
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Example 8.3.3 A police station has 12 police officers on duty. In how many ways can they be 
assigned to foot patrol in five different districts, assuming that we assign only one police officer 
per district. 

Solution: Imagine you are the officer who schedules the assignments. You have to assign 
someone to the first district, and then another officer to the second district, and so forth. 


district: 

first 

second 

third 

another 

fourth 


any 

another 

different 


choices: 

officer 

officer 

officer 


# of choices: 

12 

11 

10 

9 


There are 12 choices for the first district, 11 for the second, etc. The multiplication principle 
implies that the answer is 12 • 11 • • •, which is in the form of P(n,r). Since the product starts 
with 12, and we need a product of 5 consecutive numbers, the answer is -P(12, 5). A 

Hands-On Exercise 8.3.2 A school sends a team of six runners to a relay game. In how many 
ways can they be selected to participate in the 4 x 100 nr relay? 


A 

From a collection of 10 flags of different patterns, how many three-flag signals 
pole? 

Solution: Since the flags are arranged on a flag pole, the order is important. There are 10 
choices for the top flag, 9 for the second, and 8 for the third. Therefore, 10 • 9 • 8 = P(10, 3) 
different signals can be formed. A 

Example 8.3.5 Determine the number of functions /: {1, 3, 4, 7, 9} — ► Z 22 if 

(a) There are no restrictions. 

(b) / is one-to-one. 

(c) / is onto. 

When you get stuck, 
take a step back and 
look at a generalized 
problem. 


images: /( 1) /( 3) /( 4) /( 7) /( 9) 

choices: 


# of choices: 

(a) If there are no restrictions, we have 22 choices for each of these five images. Hence there are 
22 • 22 • 22 • 22 • 22 = 22 5 functions. 

(b) If / is one-to-one, we cannot duplicate the images. So we have 22 choices for /( 1), 21 for 
/( 3), and so on. There are P( 22,5) one-to-one functions. 


Solution: To distinguish one function from another function, we have to compare their images. 
Hence, a function is completely determined by its images (surprise: not by its formula!). After 
all, we may not even know the formula behind a function, so we cannot and should not rely on 
the formula alone. 

To determine how many functions there are from {1, 3, 4, 7, 9} to Z 22 , we have to determine 
the number of ways to assign values to /( 1), /( 3), /(4), /( 7) and /( 9). 


Example 8.3.4 

can we put on a 
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(c) There are at most five distinct images, but Z22 has 22 elements, so at least 17 of them will 
be left unused. Hence / can never be onto. The number of onto functions is therefore zero. A 

Hands-On Exercise 8.3.3 How many functions are there from {2,4,6,8,10} to Z15? How 
many of them are one-to-one? 


A 

Example 8.3.6 Let A and B be finite sets, with |H| = s and \B\ = t. Determine the number 
of one-to-one functions from A to B. 


Solution: How can we come up with a one-to-one function from A to B? We have to specify 
the image of each element in A. There are t choices for the first element. Since repeated images 
are not allowed, we have only t — 1 choices for the image of the second element in A, and t — 2 
choices for the third image, and so forth. The answer is P(t, s). 

What if t < s? We know that in such an event, there does not exist any one-to-one function 
from A to B because there are not enough distinct images. Does P(t, s) still make sense? The 
product version of the formula says that P(t, s ) is a product of s consecutive numbers. Hence, 
for example, 

P( 3, 6) = 3 • 2 • 1 • 0 • (-1) • (-2) = 0, 

which means there is no one-to-one function from A to B. A 


P ( n , r ) = 0 if n < r . 


Not all problems use P(n, r ). In many situations, we have to use P(n , r) together with other 
numbers. The safest approach is to rely on the addition and multiplication principles. 

Example 8.3.7 How many four-digit integers are there that do not contain repeated digits? 

Solution: There are 10 choices for each digit, but the answer is not P(10, 4), because we cannot 
use 0 as the first digit. To ensure that we have a four-digit integer, the first digit must be nonzero. 
This leaves us 9 choice for the first digit. Then we have 9 choices for the second digit, 8 and 7 
for the next two. The answer is 9 • 9 • 8 • 7. A 

Example 8.3.8 Twelve children are playing “musical chairs,” with 9 chairs arranged in a 
circle on the floor. In how many ways can they be seated? 

Solution: The answer is not P(12, 9) because any position can be the first position in a circular 
permutation. What matters is the relative placement of the selected objects, all we care is 
who is sitting next to whom. The correct answer can be found in the next theorem. A 

Theorem 8.3.2 The number of circular r -permutations of an n-element set is P(n,r)/r. 

Proof: Compare the number of circular r-permutations to the number of linear r-permutations. 
Start at any position in a circular r-permutation, and go in the clockwise direction; we obtain 
a linear r-permutation. Since we can start at any one of the r positions, each circular r- 
permutation produces r linear r-permutations. This means that there are r times as many 
circular r-permutations as there are linear r-permutations. Therefore, the number of circular 
r-permutations is P(n, r)/r. I 

Alternate Proof. Let A be the set of all linear r-permutations of the n objects, and let B be 
the set of all circular r-permutations. Define a function from A to B as follows. Given any 
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r-permutation, form its image by joining its “head” to its ’’tail.” It becomes clear, using the 
same argument in the proof above, that / is an r-to-one function, which means / maps r distinct 
elements from A to the same image in B. Therefore A has r times as many elements as in B. 
This means \A\ = r • \B\. Since |A| =P(n,r), we find \B\ = P(n,r)/r. ■ 

Hands-On Exercise 8.3.4 A circular cardboard has eight dots marked along its rim. In how 
many ways can we glue eight beads of different colors, one on each dot? 


A 

Hands-On Exercise 8.3.5 In how many ways can we form a necklace with eight beads of 
different color? 

Remark: When a necklace is flipped around, it is still the same necklace. Thus, the orientation 
of the necklace does not matter: we can count the beads clockwise, or counterclockwise. 


A 

Example 8.3.9 In how many ways can we arrange 20 knights at a round table? What if two 
of them refuse to sit next to each other? 

Solution: Without any restriction, there are 201/20 = 19! ways to seat the 20 knights. To 
solve the second problem, use complement. If two of them always sit together, we in effect are 
arranging 19 objects in a circle. Among themselves, these two knights can be seated in two 
ways, depending on who is sitting on the left. Hence, there are 2 • 191/19 = 2-18! ways to seat 
the 20 knights, with two of them always together. Therefore, the final answer to the second 
problem is 19! — 2 • 18!. A 

Summary and Review 

• Use permutation if order matters: the keywords arrangement, sequence, and order suggest 
that we should use permutation. 

• It is often more effective to use the multiplication principle directly. 

• The number of ways to arrange n objects linearly is n!, and the number of ways to arrange 
them in a circle is (n — 1)!. 


Exercises 8.3 

1. How many eight-character passwords can be formed with the 26 letters in the English 
alphabet, each of which can be in uppercase or lowercase, and the 10 digits? How many 
of them do not have repeated character? 

2. How many functions are there from Xq to Z 12 ? How many of them are one-to-one? 

3. The school board of a school district has 14 members. In how many ways can the chair, 
first vice-chair, second vice-chair, treasurer, and secretary be selected? 

4. The wrestling teams of two schools have eight and 10 members respectively. In how many 
ways can three matches be made up between them? 
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5. The wrestling teams of three schools have seven, 10, and 11 members, respectively. Each 
school will have three matches against each of the other two school. In how many ways 
can these matches be arranged? 

6. A teacher takes her AP calculus class of 8 students to lunch. They sit around a circular 
dining table. 

(a) How many seating arrangements are possible? 

(b) How many seating arrangements are there if the teacher has to sit on the chair closest 
to the soda fountain? 

(c) Among the students are one set of triplets. How many seating arrangements are there 
without all three of them sitting together? 

7. Eleven students go to lunch. There are two circular tables in the dining hall, one can seat 
7 people, the other can hold 4. In how many ways can they be seated? 

8. Five couples attend a wedding banquet. They are seated on a long table. How many 
seating arrangements that alternate men and women? What if the table is circular in 
shape? 

8.4 Combinations 

In many counting problems, the order of arrangement or selection does not matter. In essence, 
we are selecting or forming subsets. 

Example 8.4.1 Determine the number of ways to choose 4 values from 1, 2, 3, ..., 20, in 
whicntne order of selection does not matter. 

Solution: Let N be the number of ways to choose the 4 numbers. Since the order in which the 
numbers are selected does not matter, these are not sequences (in which order of appearance 
matters). We can change a selection of 4 numbers into a sequence. The 4 numbers can be 
arranged in P(4, 4) = 4! ways. Therefore, all these 4-number selections together produce N ■ 

4! sequences. The number of 4-number sequences is P(20,4). Thus, N • 4! = P(20,4), or 
equivalently, N = P(20,4)/4!. A 

Definition. The number of r-element subsets in an n-element set is denoted by 

C(n, r ) or ^ , 

Do not write (") as 
(y) orC("). Instead, 
write it as (") or 
C(n,r). 

An r-combination is 
an r-element subset. 


where (”) is read as “n choose r.” It determines the number of combinations of n objects, 
taken r at a time (without replacement). Alternate notations such as n C r and C T r l can be found 
in other textbooks. Do not write it as (-); this notation has a completely different meaning. <C> 

Recall that (") counts the number of ways to choose or select r objects from a pool of n 
objects in which the order of selection does not matter. Hence, r-combinations are subsets of 
size r. 


Example 8.4.2 The 2-combinations of S = {a,b,c,d} are 


{a, b}, {a, c}, {a, d}, {6, c}, {b, d}, and {c, d}. 

Therefore (2) = 6. What are the 1-combinations and 3-combinations of S? What can you say 
about the values of (f) and (3)? 
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Solution: The 1-combinations are the singleton sets {a}, {&}, {c}, and {d}. Hence, ( 4 ) = 4. 
The 3-combinations are 


{a, b, c}, {a, b, d}, {a,c, d}, and {b,c,d}. 


Thus, ( 3 ) =4. A 

Notice that both 
numerator and 
denominator have r 
factors. 

Proof: The idea is similar to the one we used in the alternate proof of Theorem 8.3.2. Let A be 
the set of all r-permutations, and let B be the set of all r-combinations. Define /: A — >■ B to be 
the function that converts a permutation into a combination by “unscrambling” its order. Then 
f is an r!-to-one function because there are r! ways to arrange (or shuffle) r objects. Therefore 

\A\=r\-\B\. 

Since \A\ = P(n,r), and \B\ = ("), it follows that (") = P(n,r)/r\. ■ 


Theorem 8.4.1 For all integers n and r satisfying 0 < r < n, we have 

/ n\ P{n, r ) n(n — 1) • • • {n — r + 1) n! 

\r ) r! r! r! (n — r)! 


Example 8.4.3 There are ( 4 5 °) ways to choose 5 numbers, without repetitions, from the inte- 
gers 1,2,..., 40. To compute its numeric value by hand, it is easier if we first cancel the common 
factors in the numerator and the denominator. We find 



40 • 39 • 38 • 37 • 36 
5 • 4 • 3 • 2 • 1 


13 • 38 • 37 • 36, 


which gives ( 4 °) = 658008. 


A 


Hands-On Exercise 8.4.1 Compute (g 2 ) by hand. 


A 

Hands-On Exercise 8.4.2 A three-member executive committee is to be selected from a group 
of seven candidates. In how many ways can the committee be formed? 


A 


Hands-On Exercise 8.4.3 How many subsets of {1,2,..., 23} have five elements? 


A 


Corollary 8.4.2 For 0 < r < n, we have 


n 


n 
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Proof: According to Theorem 8.4.1, we have 



n! 

(n — r) \ (n — (n — r))! 


n ! 

(n — r)! r! ’ 


which is precisely (™) . ■ 

Example 8.4.4 To compute the numeric value of ( 47 ), instead of computing the product of 
47 factors as indicated in the definition, it is much faster if we use 

/50\ _ /50\ _ 50 • 49 • 48 
V47y v 3 / 3-2-i ’ 

from which we obtain ( 47 ) = 19600. ▲ 

Hands-On Exercise 8.4.4 Compute, by hands, the numeric value of (j^g). 


A 

Now we are ready to look at some mixed examples. In all of these examples, sometimes we 
have to use permutation, other times we have to use combination. Very often we need to use 
both, together with the addition and multiplication principles. You may ask, how can I figure 
out what to do? We suggest asking yourself these questions: 

1. Use the construction approach. If you want to list all the configurations that meet the 
requirement, how are you going to do it systematically? 

2. Are there several cases involved in the problem? If yes, we need to list them first, before 
we go through each of them one at a time. Finally, add the results to come up with the 
final answer. 

3. Do we allow repetitions or replacements? This question can also take the form of whether 
the objects are distinguishable or indistinguishable. 

4. Does order matter? If yes, we have to use permutation. Otherwise, use combination. 

5. Sometimes, it may be easier to use the multiplication principle instead of permutation, 
because repetitions may be allowed (in which case, we cannot use permutation, although 
we can still use the multiplication principle). Try drawing a schematic diagram and decide 
what we need from it. If the analysis suggests a pattern that follows the one found in a 
permutation, you can then use the formula for permutation. 

6. Do not forget: it may be easier to work with the complement. 

It is often not clear how to get started because there seem to be several ways to start the 
construction. For example, how would you distribute soda cans among a group of students? 
There are two possible approaches: 

(i) From the perspective of the students. Imagine you are one of the students, which soda 
would you receive? 

(ii) From the perspective of the soda cans. Imagine you are holding a can of soda, to whom 
would you give this soda? 


What should we use: 
the addition or the 
multiplication 
principle? 
Permutation or 
combination? Here 
are some suggestions. 


More suggestions. 


Depending on the actual problem, usually only one of these two approaches would work. 
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Example 8.4.5 Suppose we have to distribute 10 different soda cans to 20 students. It is 
clear that some students may not get any soda. In fact, some lucky students could receive more 
than one soda (the problem does not say this cannot happen). Hence, it is easier to start from 
the perspective of the soda cans. 

We can give the first soda to any one of the 20 students, and we can also give the second 
soda to any one of the 20 students. In fact, we always have 20 choices for each soda. Since we 
have 10 sodas, there are 20 • 20 • • • 20 = 20 10 ways to distribute the sodas. A 

10 

Example 8.4.6 In how many ways can a team of three representatives be selected from a 
class of 885 students? In how many ways can a team of three representatives consisting of a 
chairperson, a vice-chairperson, and a secretary be selected? 

Solution: If we are only interested in selecting three representatives, order does not matter. 
Hence, the answer would be ( 885 ) . If we are concerned about which offices these three represen- 
tative will hold, then the answer should be P(885,3). A 

Hands-On Exercise 8.4.5 Mike needs some new shirts, but he has only enough money to 
purchase five of the eight that he likes. In how many ways can he purchase the five shirts by 
choosing them at random? 


A 

Hands-On Exercise 8.4.6 Mary wants to purchase four shirts for her four brothers, and she 
would like each of them to receive a different shirt. She finds ten shirts that she thinks they will 
like. In many ways can she select them? 


A brief tutorial for 
those who do not 
play cards. 


A 

Playing cards provide excellent examples for counting problems. Just in case you are not 
familiar with them, let us briefly review what a deck of playing cards contains. 

• There are 52 playing cards, each of them is marked with a suit and a rank. 

• There are four suits: spades (4), hearts (<?), diamonds (<» and clubs (#). 

• Each suit has 13 ranks, labeled A, 2, 3, . . . , 9, 10, J, Q, and K, where A means ace, J 
means jack, Q means queen, and K means king. 

• Each rank has 4 suits (see above). 


Example 8.4.7 Determine the number of five-card poker hands that can be dealt from a deck 
of 52 cards. 

Solution: All we care is which five cards can be found in a hand. This is a selection problem. 
The answer is ( 5 5 2 ) . A 


Hands-On Exercise 8.4.7 In how many ways can a 13-card bridge hand be dealt from a 
standard deck of 52 cards? 


A 
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Example 8.4.8 In how many ways can a deck of 52 cards be dealt in a game of bridge? (In 
a bridge game, there are four players designated as North, East, South and West, each of them 
is dealt a hand of 13 cards.) 

Solution: The difference between this problem and the last example is that the order of dis- 
tributing the four bridge hands makes a difference. This is a problem that combines permutations 
and combinations. As we had suggested earlier, the best approach is to start from scratch, us- 
ing the addition and/or multiplication principles, along with permutation and/or combination 
whenever it seems appropriate. 

There are (^) ways to give 13 cards to the first player. Now we are left with 39 cards, from 
which we select 13 to be given to the second player. Now, out of the remaining 26 cards, we have 
to give 13 to the third player. Finally, the last 13 cards will be given to the last player (there is 
only one way to do it). The number of ways to deal the cards in a bridge game is ('^) ( 8 g) (^). 

We could have said the answer is 



The last factor ( 4 g) is the number of ways to give the last 13 cards to the fourth player. Numer- 
ically, (Jg) = 1, so the two answers are the same. Do not dismiss this extra factor as redundant. 
Take note of the nice pattern in this answer. The bottom numbers are 13, because we are 
selecting 13 cards to be given to each player. The top numbers indicate how many cards are still 
available for distribution at each stage of the distribution. The reasoning behind the solution is 
self-explanatory! A 

Example 8.4.9 Determine the number of five-card poker hands that contain three queens. 
How many of them contain, in addition to the three queens, another pair of cards? 

Solution: (a) The first step is to choose the three queens in ( 4 ) ways, after which the remaining 
two cards can be selected in ( 48 ) ways. Therefore, there are altogether ( 4 ) ( 48 ) hands that meet 
the requirements. 

(b) As in part (a), the three queens can be selected in ( 4 ) ways. Next, we need to select the 
pair. We can select any card from the remaining 48 cards (therefore, there are 48 choices), after 
which we have to select one from the remaining 3 cards of the same rank. This gives 48 • 3 
choices for the pair, right? The answer is NO\ 

The first card we picked could be 9?8, and the second could be ^8. However, the first 
card could have been X 8, and the second ^8. These two selections are counted as different 
selections, but they are actually the same pair! The trouble is, we are considering “first,” and 
“second” cards, which in effect imposes an ordering among the two cards, thereby turning it 
into a sequence or an ordered selection. We have to divide the answer by 2 to overcome the 
double-counting. The answer is therefore 

Here is a better way to count the number of pairs. An important question to ask is 

Which one should we pick first: the suit or the rank? 

Here, we want to pick the rank first. There are 12 choices (the pair cannot be queens) for the 
rank, and among the four cards of that rank, we can pick the two cards in ( 4 ) ways. Therefore, 
the answer is 12 ( 2 ). Numerically, the two answers are identical, because 12(/) = 12 • ^ 

In summary: the final answer is ( 4 ) • 12 ( 2 ). A 

Hands-On Exercise 8.4.8 How many bridge hands contain exactly four spades? 


The main question is: 
choose the suit or the 
rank Erst? 


A 
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Hands-On Exercise 8.4.9 How many bridge hands contain exactly four spades and four 
hearts? 


A 

Hands-On Exercise 8.4.10 How many bridge hands are there containing exactly four spades, 
three hearts, three diamonds, and three clubs? 


A 

Example 8.4.10 How many positive integers not exceeding 99999 contain exactly three 7s? 

Solution: Regard each legitimate integer as a sequence of five digits, each of them selected 
from 0, 1, 2, . . . , 9. For example, the integer 358 can be considered as 00358. Three out of 
the five positions must be occupied by 7. There are ( 3 ) ways to select these three slots. The 
remaining two positions can be filled with any of the other nine digits. Hence, there are ( 3 ) • 9 2 
such integers. A 

Example 8.4.11 How many five-digit positive integers contain exactly three 7s? 

Solution: Unlike the last example, the first of the five digits cannot be 0. Yet, the answer is 
not ( 3 ) • 9 • 8. Yes, there are ( 3 ) choices for the placement of the three 7s, but some of these 
selections may have put the 7s in the last four positions. This leaves the first digit unfilled. The 
nine choices counted by 9 allows a zero to be placed in the first position. The result is, at best, 
a four-digit number. The correct approach is to consider two cases: 

• Case 1. If the first digit is not 7, then there are eight ways to fill this slot. Among the 
remaining four positions, three of them must be 7, and the last one can be any digit other 
than 7. So there are 8 • ( 3 ) • 9 integers in this category. 

• Case 2. If the first digit is 7, we still have to put the other two 7s in the other four 
positions. There are ( 2 ) • 9 2 such integers. 

Together, the two cases give a total of 8 • ( 3 ) • 9 + ( 2 ) • 9 2 = 774 integers. A 

Hands-On Exercise 8.4.11 Five balls are chosen from a bag of eight blue balls, six red balls, 
and five green balls. How many of these five-ball selections contain exactly two blue balls? 


A 

Example 8.4.12 Find the number of ways to select five balls from a bag of six red balls, eight 
blue balls and four yellow balls such that the five-ball selections contain exactly two red balls 
or two blue balls. 

Solution: The keyword “or” suggests this is a problem that involves the union of two sets, 
hence, we have to use PIE to solve the problem. 
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• How many selections contain two red balls? Following the same argument used in the last 
example, the answer is ( 2 )(g 2 ). 

• How many selections contain two blue balls? The answer is (*) (3°). 

• How many selections contain two red balls and 2 blue balls? The answer is (®) (®) (f) • 
According to PIE, the final answer is 



In each term, the upper numbers always add up to 18, and the sum of the lower numbers is 
always 5. Can you explain why? A 

Example 8.4.13 We have 11 balls, five of which are blue, three of which are red, and the 
remaining three are green. How many collection of four balls can be selected such that at least 
two blue balls are selected? Assume that balls of the same color are indistinguishable. 


Solution: The keywords “at least” mean we could have two, three, or four blue balls. There 
are 



ways to select four balls, with at least two of them being blue. A 

Hands-On Exercise 8.4.12 Jerry bought eight cans of Pepsi, seven cans of Sprite, three cans 
of Dr. Pepper, and six cans of Mountain Dew. He want to bring 10 cans to his pal’s house 
when they watch the basketball game tonight. Assuming the cans are distinguishable, say, with 
different expiration dates, how many selections can he make if he wants to bring 

(a) Exactly four cans of Pepsi? 

(b) At least four cans of Pepsi? 

(c) At most four cans of Pepsi? 

(d) Exactly three cans of Pepsi, and at most three cans of Sprite? 


A 

The proof of the next result uses what we call a combinatorial or counting argument. In 
general, a combinatorial argument does not rely on algebraic manipulation. Rather, it uses the 
combinatorial significance of the situations to solve the problem. 
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Theorem 8.4.3 Prove that Ylr=o (") = 2 n f or a ^ nonnegative integers n. 

An example of 
a combinatorial 
argument. 


Summary and Review 

• Use permutation if order matters, otherwise use combination. 

• The keywords arrangement, sequence, and order suggest using permutation. 

• The keywords selection, subset, and group suggest using combination. 

• It is best to start with a construction. Imagine you want to list all the possibilities, how 
would you get started? 

• We may need to use both permutation and combination, and very likely we may also need 
to use the addition and multiplication principles. 

Exercises 8.4 

1. If the Buffalo Bills and the Cleveland Browns have eight and six players, respectively, 
available for trading, in how many ways can they swap three players for three players? 

2. In the game of Mastermind, one player, the codemaker, selects a sequence of four colors 
(the “code”) selected from red, blue, green, white, black, and yellow. 

(a) How many different codes can be formed? 

(b) How many codes use four different colors? 

(c) How many codes use only one color? 

(d) How many codes use exactly two colors? 

(e) How many codes use exactly three colors? 

3. Becky likes to watch DVDs each evening. How many DVDs must she have if she is able 
to watch every evening for 24 consecutive evenings during her winter break? 

(a) A different subset of DVDs? 

(b) A different subset of three DVDs? 

4. Bridget has n friends from her bridge club. Every Thursday evening, she invites three 
friends to her home for a bridge game. She always sits in the north position, and she 
decides which friends are to sit in the east, south, and west positions. She is able to do 
this for 200 weeks without repeating a seating arrangement. What is the minimum value 
of n? 

5. Bridget has n friends from her bridge club. She is able to invite a different subset of three 
of them to her home every Thursday evening for 100 weeks. What is the minimum value 
of n? 

6. How many five-digit numbers can be formed from the digits 1, 2, 3, 4, 5, 6, 7? How many 
of them do not have repeated digits? 

7. The Mathematics Department of a small college has three full professors, seven associate 
professors, and four assistant professors. In how many ways can a four-member committee 
be formed under these restrictions: 

(a) There are no restrictions. 

(b) At least one full professor is selected. 

(c) The committee must contain a professor from each rank. 


Proof: Since (") counts the number of r-element subsets selected from an n-element set S, the 
summation on the left is the sum of the number of subsets of S of all possible cardinalities. In 
other words, this is the total number of subsets in S. We learned earlier that S has 2" subsets, 
which establishes the identity immediately. ■ 
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8. A department store manager receives from the company headquarters 12 football tickets 
to the same game (hence they can be regarded as “identical”). In how many ways can she 
distribute them to 20 employees if no one gets more than one ticket? What if the tickets 
are for 12 different games? 

9. A checkerboard has 64 distinct squares arranged into eight rows and eight columns. 

(a) In how many ways can eight identical checkers be placed on the board so that no two 
checkers can occupy the same row or the same column? 

(b) In how many ways can two identical red checkers and two identical black checkers be 
placed on the board so that no two checkers of the same color can occupy the same 
row or the same column? 

10. Determine the number of permutations of { A , B, C, D , E} that satisfy the following con- 
ditions: 

(a) A occupies the first position. 

(b) A occupies the first position, and B the second. 

(c) A appears before B. 

11. A binary string is a sequence of digits chosen from 0 and 1. How many binary strings of 
length 16 contain exactly seven Is? 

12. In how many ways can a nonempty subset of people be chosen from eight men and eight 
women so that every subset contains an equal number of men and women? 

13. A poker hand is a five-card selection chosen from a standard deck of 52 cards. How many 
poker hands satisfy the following conditions? 

(a) There are no restrictions. 

(b) The hand contains at least one card from each suit. 

(c) The hand contains exactly one pair (the other three cards all of different ranks). 

(d) The hand contains three of a rank (the other two cards all of different ranks). 

(e) The hand is a full house (three of one rank and a pair of another). 

(f) The hand is a straight (consecutive ranks, as in 5, 6, 7, 8, 9, but not all from the 
same suit). 

(g) The hand is a flush (all the same suit, but not a straight). 

(h) The hand is a straight flush (both straight and flush). 

14. A local pizza restaurant offers the following toppings on their cheese pizzas: extra cheese, 
pepperoni, mushrooms, green peppers, onions, sausage, ham, and anchovies. 

(a) How many kinds of pizzas can one order? 

(b) How many kinds of pizzas can one order with exactly three toppings? 

(c) How many kinds of vegetarian pizza (without pepperoni, sausage, or ham) can one 
order? 

8.5 The Binomial Theorem 

A binomial is a polynomial with exactly two terms. The binomial theorem gives a formula 
for expanding (x + y) n for any positive integer n. 

How do we expand a product of polynomials? We pick one term from the first polynomial, 
multiply by a term chosen from the second polynomial, and then multiply by a term selected 
from the third polynomial, and so forth. In the special case of ( x + y) n , we are selecting either 
x or y from each of the n binomials x + y to form a product. Some of these products will be 
identical, hence, we need to collect their coefficients. The expansion of ( x + y ) 3 is demonstrated 
below. 
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f 

~ f 

~ f ' 

I 

t 

I 

I 

I 


1 

1 

i 

(x + y) ■ 

1 (x + y) ■ 

1 (x + y) 


I 

i 


I 

4 


I 

4 


i 

4 


xxx = x 3 
xxy = x 2 y 
xyx = x 2 y 
xyy = xy 2 

yxx = x 2 y 
yxy = xy 2 
yyx = xy 2 

yyy = y 3 


We find 


There are four ways 
to write binomial 
theorem; they are all 
equivalent. 


Here is how to 
remember them. 


(x + y) 3 


(a ' + y)(x + y)(x + y) 

xxx + xxy + xyx + xyy + yxx + yxy + yyx + yyy 
x 3 + x 2 y + x 2 y + xy 2 + x 2 y + xy 2 + xy 2 + y 3 
x 3 + 3 x 2 y + 3 xy 2 + y 3 . 


What happens when we expand (x + y) n l 

If we select y from k copies of the (x + y) s, and x from the other n — k copies, their product 
will be x n ~ k y k . Therefore, in the expansion of (x + y) n , a typical term will be of the form 
x n-kyk , w i iere 0 < k < n. The question is, what is its coefficient in the expansion, after we 
collect like terms? This coefficient is the number of times the product x n ~ k y k appears when we 
multiply out (x + y) n in the way described above. It depends on which k copies of the (x + y) s 
we will choose y from. There are (?) choices, hence, the product x n ~ k y k appears (^) times. 
Thus, the coefficient is (^). For this reason, we also call (^) the binomial coefficients. 

Theorem 8.5.1 (Binomial Theorem) For any positive integer n, 

(x + y) n = 

fc=o ' ' 

Because of the symmetry in the formula, we can interchange x and y. In addition, we also 
have ()() = ( ” fc ) . Consequently, the binomial theorem can be written in three other forms: 


= t („"*)*"" v ’ 

k = 0 x 7 

(x + y) n = E(fc)*V“ fc , 

fc= 0 ^ ' 



You need not worry which one to use. They are all the same! This is how to remember these 
four different forms. In each term, the powers of x and y always add up to n. If the power of 
one of the two variables is k, where 0 < k < n, then the power of the other must be n — k, and 
we need to multiply the coefficient (?), which is the same as ( n 1 k ), to their product. 

When expanding (x + y) n , it may be helpful if you first lay out all the terms x n , x n ~ 1 y, 
x n ~ 2 y 2 , and so forth. Then you fill in with the binomial coefficients. For instance, to expand 
(x + y) 3 , we first list all the terms that we expect fo find: 


( x + y ) 3 = x 


2 , 2 i 3 

x V + xy + y . 


,3 



8.5 The Binomial Theorem 


245 


Next we fill in the binomial coefficients: 

(x + y) 3 = 


x 2 y 


xy 


y 3 - 


Finally, evaluate the binomial coefficients and simplify the result. 

(x + y) 3 = x 3 + 3 x 2 y + 3xy 2 + y 3 . 

In a similar way, we also find (x — y) 3 = x 3 — 3x 2 y + 3 xy 2 — y 3 . Note the similarity between the 
two expansions. 

Example 8.5.1 Compute (a: + y) 4 . 

Solution: Following the steps we outlined above, we find 

(x + y ) 4 = 


x V 


2 2 , 
x y + 


xy 


= x 4 + 4 x 3 y + 6a ,y 2 + 4xy 3 + y ■ 


Since (q) = (") = 1, the expansion always starts with x n and ends with y r ‘ 
Example 8.5.2 Compute (a; — y) 4 . 

Solution: We find 

{x - y) 4 = [x + {-y)} 4 


x4 + L x 3 (~y) + L * 2 (-y) 2 + o F(-y) 3 + , (-y) 


= x 4 — 4x 3 y + 6a ,2 y 2 — 4xy 3 + y 4 . 


Take note of the alternating signs in the expansion. This suggests that we could expand ( A—B) n 
the exact same way we would with (A + B ) n , except that the signs alternate. 

We can carry out the expansion by following these steps. First, list all the terms we expect 
to find 

{x + y) 4 = 

Next, fill in the signs: 

{x + y) 4 = _ 


x 3 y 


2 2 
x y 


xy 


y 


x 4 - 


x 3 y 


2 2 
x y 


xy 


x 4 — 


x 3 y 


2 2 
x y 


xy 3 + 


and then the binomial coefficients: 

(x + y) 4 = 

Finally, compute the binomial coefficients to finish the expansion. 

Example 8.5.3 Expand (2x — 3 y) 5 . 

Solution: The expansion yields 

( 2x ) 5 - (2a;) 4 (3y) + ff) (2x) 3 (3y) 2 - (f) (2xf(3 y) 3 + ( ° A ) (2a:) (3 y) 4 - (3 y) 


Therefore, (2x - 3 yf = 32a; 5 - 240 x 4 y + 720a : 3 y 2 - 1080x 2 y 3 + 810a :y 4 - 243 y 5 . 
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Hands-On Exercise 8.5.1 Use the binomial theorem to expand (3a: — 5 y) 4 . 


A 


Example 8.5.4 Find the coefficient of x 3 in the expansion of (1 + a:) 102 . 


Solution: Since 

( i+-) io 2 =e(T)^ 

the term containing x 3 is ( 192 )a: 3 . Therefore, the coefficient is ( 192 ). Depending on which form 
of the binomial theorem you use, you may end up with the term ( 1 g ° g 2 )a; 3 . Numerically, this gives 
us the same coefficient, because ( g 0 g 2 ) = ( 10 2 _ 9 g) = C” 2 )- A 

Example 8.5.5 What is the coefficient of t 4 in the expansion of (2 + 3t) 9 ? 


Solution: Since 


(2 + 3 t) 9 ^J2 

k—0 


we need k = 4. The coefficient is ( 9 )2 5 • 3 4 -. 


2 9 ~ k (3t) k , 


A 


Example 8.5.6 What is the coefficient of t 5 in the expansion of (3 — 2 1) 7 ? 

Solution: Since (3 — 2 1) 7 = Y^k=o Ck)3 7 ~ k (~ 2t) fe , we need k = 5, and the coefficient is (g)3 2 • 
(-2) 5 = -( 5 7 )3 2 -2 5 . " A 


Hands-On Exercise 8.5.2 What is the coefficient of t 5 in (1 + 3t) 8 ? 


A 


Hands-On Exercise 8.5.3 What is the coefficient of t 4 in the expansion of (2 — 5t) 9 ? 


A 


Example 8.5.7 What is the coefficient of t 6 in the expansion of (4 + 5t 2 ) 8 ? 

Solution: The general term in the expansion is ( 8 )4 8-fe (5 t 2 ) k = (*)4 8-fc • 5 k t 2k . Hence, we 
need k = 3, and the coefficient is (g)4 5 • 5 3 . A 
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Hands-On Exercise 8.5.4 What is the coefficient of t 9 in the expansion of (3 — 2f 3 ) 8 ? 


A 

The constant term in an expansion does not contain any variable. It can be interpreted as 
the term containing x°. 


Example 8.5.8 Find the constant term in the expansion of ( x 
Solution: The general term in the expansion is 

k 


,8 — k 


l-k 


2 k x 8 ~ 2k . 


We need 8 — 2k = 0 or k = 4. Therefore, the coefficient is ( 8 ) 2 4 . 


Hands-On Exercise 8.5.5 Find the constant term in the expansion of the two expressions 

3\ 9 / 3 

x H — and \2x 


A 


Example 8.5.9 Determine the coefficient of x‘ in the expansion of (1 + x + x 2 )(l + x) 10 . 
Solution: Expand (1 + x + x 2 )(l + x) 10 as follows: 


10 /m\ 

(l+x + x 2 )(l + a:) 10 = (l + x + x 2 )J2 

k = 0 ^ ' 


10 


= E 


k—0 


10 


10 

E 

k = 0 


10 


„k+l 


10 

E 

k — 0 


10 


„fe+ 2 


So the coefficient of x 7 is (y 0 ) + (g°) + (g 0 ). 


Hands-On Exercise 8.5.6 Find the coefficient of x 8 in the expansion of (1— 2x+3a: 2 )(l-|-2a;) 12 . 


A 
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Another example 
of a combinatorial 
argument. 


To compute the binomial coefficients quickly, one may use the Pascal triangle , in which 
the nth row (n > 0) consists of the binomial coefficients (^) , where 0 < k < n: 

1 

1 1 

12 1 
13 3 1 

1 4 6 4 1 

15 10 10 5 1 

1 6 15 20 15 6 1 

Constructing the Pascal triangle is easy. We generate the rows one at a time. The extreme 
ends are always 1. Each of the interior entries is the sum of the two entries right above it in the 
preceding row. For instance, the next row (for n = 7) should be 

1 7 21 35 35 21 7 1 

Such computations produce the right binomial coefficients, because of the next result. 

Theorem 8.5.2 (Pascal’s Identity) For all integers n and k satisfying 1 < k < n, 



Proof 1: (Analytic Proof) It follows from the definition of binomial coefficients that 

A ( n ~ !\ = (n- 1)! (»-!)! 

l) V k ) (k-l)\{n-k)\ k\{n-k-l)\ 

(n-iy. f 1 1\ 

( k — 1)! (n — k — 1)! \n — k k) 

(n — 1)! n 

( k — 1)! (n — k — 1)! k(n — k) 
n\ 

k\ (n — k)\ 

This completes the proof. ■ 

Proof 2: (Combinatorial Proof) Let A be an n-element set. Then (?) counts the number 
of fc-element subsets of A. These subsets can be classified according to whether they contain a 
fixed element, say x. If a subset contains x, then the other k — 1 elements must be selected from 
the remaining n — 1 elements of A. Otherwise, if the subset does not contain x, then all its k 
elements must be selected from the other n — 1 elements of A. The numbers of these two kinds 
of subsets are given by and (" f 1 ), respectively. The theorem now follows immediately by 

applying the addition principle. ■ 

Hands-On Exercise 8.5.7 Determine the 8th and the 9th rows in the Pascal’s triangle. 



A 


Example 8.5.10 Use the Pascal’s triangle to expand 
(a) (C - Df (b) (2 A + 5 Bf (c) (3 C - 4 B) 4 





8.5 The Binomial Theorem 


249 


Solution: Draw the values of ((() from the Pascal triangle directly. The answers are: 

(a) (C - D) 5 = C 5 - hC 4 D + 10 C 3 D 2 - 10 C 2 D 3 + 5 CD 4 - D 5 . 

(b) (2a + 5 B) 3 = 8H 3 + 60 A 2 B + 150 AB 2 + 125 B 3 . 

(c) (3 C - 4 B) 4 = 81C 4 - 432C 3 J5 + 864C 2 B 2 - 768 CB 3 + 256B 4 . A 


Many interesting results can be derived from the binomial theorem. 

Example 8.5.11 Setting x = y = 1, we obtain a simple (analytic) proof of the familiar identity 

2" = £;.<, Cl * 

Example 8.5.12 Letting x = 1 and y = — 1 yields 0 = zLfc=o(”l) fe (fe)- We can rewl 'it e it as 


Combinatorially, this means the number of subsets of even cardinalities equals the number of 
subsets of odd cardinalities. A 


Summary and Review 

• The binomial theorem can be expressed in four different but equivalent forms. 

• The expansion of ( x + y) n starts with x n , then we decrease the exponent in x by one, 
meanwhile increase the exponent of y by one, and repeat this until we have y n . 

• The next few terms are therefore x n ~ 1 y, x n ~ 2 y 2 , etc., which end with y n . 

• In general, the sum of exponents in x and y is always n. Hence, the general term is x k y n ~ k , 
whose coefficient is (^) . 

• The expansion of (x + y) n and (x — y) n look almost identical, except that the signs in 
(x — y) n alternate. 


Exercises 8.5 


1. Use binomial theorem to expand the following expressions: 


(a) (x + yf (b) (s - tf 

2. Find the coefficient of 


(c) (a + 3b) 4 


(a) x 14 y 3 in (a; + y) 14 
(c) x 4 y 3 in (3x + 2 y) 7 


(b) x 4 y 7 in (2x — y) 11 

(d) x 5 in (1 — x + a; 2 )(l + x) 7 


3. Find the constant term in the expansion of 


(a) (x+1) 

(c) ( 3i2 - 7 ?) 


(b) 

(cl) (l-x 2 +x 3 ) 0* 2 - 


4. 



3” for any positive integer n. 


n 

5. Let n be a positive integer. Evaluate 

k = o 


r k for any real number r. 
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6. Find a closed form for the summation k 


k - o 


Hint : Differentiate (1 + x) n with respect to x. 

7. The objective of this problem is to derive a formula for k 2 . 

(a) Use induction to show that 


Mi 


k = 1 


i) 


for any positive integer n. 

(b) Use induction to show that 


E 

*;= i 


k\ n(n + 1 )(n — 1) 

2 / = “ 


3! 


for any positive integer n. 

Hint: Note that Q) = 0. 

(c) Find the integers a and b such that 


k 2 = a 


Hint: Compare coefficients, 

(d) From part (c), we obtain 




E fc2 = «E 

k— 1 k= 1 ^ / k — 1 

Apply the results from parts (a) and (b) to derive a formula for J2k=i k 2 ■ 

8. The objective of this problem is to derive a formula for ^ 3 - 

(a) Use induction to show that 


E 

k = 1 


n(n + l)(n — l)(n — 2) 


4! 


for any positive integer n. 

Hint: Note that Q) = 0. 

(b) Find the integers a, 6, and c such that 

k 3 = aC:] +b 


+ c 


Hint: Compare coefficients. 

(c) Apply the results from parts (a) and (b) to derive a formula for J2k=i ^ 3 • 



Appendix A 


Solutions to Hands-On 


Exercises 


Section 1.4 

1. Two proofs are given below, one uses direct expansion, the other uses factorization. 
Solution 1: Expanding the two sides separately, we find 
fc(fc + l)(fc + 2) 


+ (fc + l)(fc + 2) — 


fc 3 + 3 fc 2 + 2 k 2 07 n 

h fc + 3fc + 2 

O 

k 3 + 3 k 2 + 2k + 3(fc 2 + 3 k + 2) 
3 

fc 3 + 6 k 2 + llfc + 6 


and 


(fc + l)(fc + 2 ){k + 3) ( k 2 + 3 k + 2 )(fc + 2) k 3 + 6 fc 2 + llfc + 6 


which establish the identity. 
Solution 2: Since 

fc(fc + l)(fc + 2) 


the identity always holds. 


H - (fc + l)(Ai + 2) — (/c + l)(Ai + 2) + 1 

(fc + l)(fc + 2)(fc + 3) 
3 : 


Section 2.1 


1. Any example of the form 


If 



then 


. Therefore, if 

not . . . 

then 

not . . . 


will work. 

2. (a) We do not know which “he” the sentence is referring to. 

(b) We do not know the values of x and y. 

(c) While the equation is true if A and B are numbers, it is not always true if A and B 
are matrices. 

3. (a) x is an integer less than or equal to 7. 

(b) We cannot factor 144 into a product of prime numbers. 

(c) The number 64 is not a perfect square. 
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Section 2.2 

1. £ is rational and y is rational; (x € Q) A (y € Q). 

2. (a) Since “\/30 < 5” is false, and “v^ > 7” is true, the statement “(\/30 < 5)A(v / 30 > 7)” 

is false. 

(b) Since “%/.30 > 5” is true and “-\/30 < 7” is false, the statement “(-\/30 > 5) V (-\/30 < 7)” 
is true. 

3. (5 < x) A (x < 8). 

4. The statement “0 > x > 1” means “(0 > x)A(x > 1).” Since no number can be less than or 

equal to 0 and greater than or equal to 1 simultaneously , the statement “(0 > x) A(x > 1)” 

is always false. 

Section 2.3 

1. (a) one (b) two (c) none 

2. x>y>0=>x 2 >y 2 . 

3. (a) False, because we could have x = 3, then “(x — 2) (x — 3) = 0” is true but “x = 2” is 
false. This makes the implication false. 

(b) True, because if “x = 2” is true, “(x — 2) (x — 3) = 0” would be true as well. Thus, the 
implication is true. 

4. (a) p: The figure PQRS is a square, 

q: The figure PQRS is a parallelogram. 

(b) p: The number a; is a prime number, 
q: The number x is an integer. 

(c) p: The function f(x) is a polynomial, 
q: The function f(x) is differentiable. 

5. converse: if y fp is irrational, then p is prime 

inverse: if p is composite, then ^Jp is rational 

contrapositive: if ^Jp is rational, then p is composite 

6. (a) p : x > 1; q : x 2 > 1. 

(b) p : x 2 > 1; q : x > 1. 

Section 2.4 

1. The completed statement is “n is odd -£=> n = 2k + 1 for some integer fc.” 

Proof : Assume n is odd, then we can write n = 2k + 1 for some integer k. We find 
n 2 = (2k + l) 2 = 4 k 2 + 4k+l = 2(2 k 2 + 2k) + 1, 
where 2 k 2 + 2k is an integer. Hence, n 2 is also odd. 
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2. The statement “p => q A r” means “p => (q A r).” Below is its truth table. 


p 

q 

r 

q A r 

p => (pAr) 

T 

T 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

T 

F 

F 

T 

F 

F 

F 

F 

F 

T 

T 

T 

T 

F 

T 

F 

F 

T 

F 

F 

T 

F 

T 

F 

F 

F 

F 

T 


3. We can write “(pA g) O (pV g).” To construct its truth table, we need to evaluate each 
component one at a time: 


P 

q 

pAq 

P 

q 

p\J q 

(pAg)»(pV q) 

T 

T 

T 

F 

F 

F 

F 

T 

F 

F 

F 

T 

T 

F 

F 

T 

F 

T 

F 

T 

F 

F 

F 

F 

T 

T 

T 

F 


Section 2.5 

1. In general, if we have n statements, we need 2" rows in the truth table. 


V 

9 

r 

pAq 

(P A q) => r 

r 

P 

9 

P V q 

r => (pV q) 

KP A q) => r) => [r => (p V g)] 

T 

T 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

T 

F 

T 

F 

T 

F 

F 

F 

F 

T 

T 

F 

T 

F 

T 

F 

F 

T 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

T 

T 

T 

T 

F 

T 

T 

F 

T 

F 

T 

F 

T 

T 

T 

F 

T 

F 

F 

T 

T 

T 

F 

T 

T 

T 

F 

F 

T 

F 

T 

F 

T 

T 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

T 

T 

T 

T 


P 

q 

q 

q 

P 

q^p 

(c) 

P 

q 

PA q 

P 

q 

pVg 

pVg 

T 

T 

T 

F 

F 

T 


T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

F 

F 


T 

F 

F 

F 

T 

T 

F 

F 

T 

T 

F 

T 

T 


F 

T 

F 

T 

F 

T 

F 

F 

F 

T 

T 

T 

T 


F 

F 

F 

T 

T 

T 

F 


P 

P 

pVp 

(d) 

P 

q 

p^q 

q 


(p => q) A (g => p) 

T 

T 

T 


T 

T 

T 

T 

T 

T 

F 

F 

F 


T 

F 

F 

F 

T 

F 




F 

T 

F 

T 

F 

F 




F 

F 

T 

T 

T 

T 


3. We need to compare the truth values of the three formulas: 

p^q, (pVg)ApAg, and (pAg) V (pAg). 
The truth table for comparing them is depicted below. 


p 

9 

p-q 

P Vg 

pAq 

P A q 

{p V q) A p A q 

P 

9 

pAq 

pAq 

(P A q) V (p A q) 

T 

T 

F 

T 

T 

F 

F 

F 

F 

F 

F 

F 

T 

F 

T 

T 

F 

T 

T 

F 

T 

T 

F 

T 

F 

T 

T 

T 

F 

T 

T 

T 

F 

F 

T 

T 

F 

F 

F 

F 

F 

T 

F 

T 

T 

F 

F 

F 
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4. The statement “0 > x > 1” means “(0 > x) A (x > 1),” which is always false because no 
such x exists. 

5. We find 

(p V q) A (r V s) = [p A (r V s)] V [g A (r V s)] 

= (p A r) V (p A s) V (q A r) V (q A s). 


Section 2.6 

1. (a) false (b) true (c) true 

2. The first three twin primes are 3 and 5, 5 and 7, and 11 and 13. 

3. (a) False, a counterexample is x = 2. 

(b) True. 

(c) False, because an even number is defined as an integral multiple of 2. 

(d) True, see (c). 

(e) False, x = y/2 provides a counterexample. 

4. False, because 0 2 = 0. 

5. True, because we can pick y = 0, then regardless of what x is, we always have xy = 0 < 1. 

6. (a) There exists a prime number x such that the number x + 1 is prime. 

(b) There exists a prime number x > 2 such that the number x + 1 is prime. 

(c) For any integer k. the number 2 k + 1 is odd. 

(d) There exists an integer k such that 2k is odd. 

(e) There exists a number x such that x 2 is an integer and x is not an integer. 

7. One solution is: “There is a Discrete Mathematics student who has not taken Calculus I 
and Calculus II.” Because of De Morgan’s laws, we can also state the negation as “There is 
a Discrete Mathematics student who has not taken Calculus I or has not taken Calculus II.” 


Section 3.1 

1. The distance between a and | a + | b is 

(1 2 \ 2 2 2., 

\3 3 / 3 3 3 ^ 

The distance between | a + | b and b is 

b— f ^ a + \ b\ = \ b — - a = ^ (b — a). 

\3 3^ 3 3 3 ' ’ 

Since | (6 — a) > | (b — a), the point | a + § b is closer to b than to a. 

2. 6 = 2 • 3, 40 = 2 3 • 5, 32 = 2 5 • 1, and 15 = 2° • 15. 

3. The five consecutive integers 722, 723, 724, 725, and 726 are composite. 

4. Since a and b are rational numbers, we can write a = — and b = | for some integers m, 
n, r, and s, where n,s^0. Then the midpoint of the interval [a, b\ is 

a + b 1 /m r\ ms + nr 

2 2 V n + s) 2 ns ’ 

where ms+nr and 2 ns are integers, and 2 ns ^ 0. Hence, is a rational number between 
a and b. 
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5. Using the same argument in Hands-On Exercise 3.1.4, we see that | a + | b is rational, 
and we have learned from Hands-On Exercise 3.1.1 that it is closer to b than to a. 

6. Let g(x) — 1 + xcosx. Noting that g(— 7r) = 1 — 7r < 0 and 5(0) = 1 > 0, we conclude 
that the equation g(x) = 0 has a real solution between — 7r and 0. 


Section 3.2 

1. Assume n is odd, we can write n — 2k + 1 for some integer k. Then 

n 3 = (2k + l) 3 = 8 k 3 + 12 k 2 + 6fc + 1 = 2(4 k 3 + 6 fc 2 + 2k) + 1, 
where 4fc 3 + 6fc 2 + 2k is an integer. Thus, n 3 is odd. 

2. Assume x 3 + 6a: 2 + 12a; + 8 = 0. Since 

a; 3 + 6a; 2 + 12a; + 8 = (x + 2) 3 , 
we must have (x + 2 ) 3 = 0. It follows that x = —2. 

3. There are two cases. 

• Case 1: If n is even, then n = 2 q for some integer q. so that 

n 3 + n = (2g) 3 + 2 q = 8 q 3 + 2 q = 2(4 q 3 + q), 
where 4 q 3 + q is an integer. 

• Case 2: If n is odd, then n = 2q + 1 for some integer q, so that 

n 3 +n=(2q + l) 3 + (2 q + 1) = 8 q 3 + 12 q 2 + 8q + 2 = 2(4 q 3 + 6 q 2 +4 q + 1), 
where 4 q 3 + 6 q 2 + 4g + 1 is an integer. 

In both cases, we have proved that n 3 + n is even. 

Section 3.3 

1. Assume a; is a real number such that x ^ —5 and x ^ 7, then x + 5 7^ 0 and x — 7 ^ 0. 
Since 2a; 2 + 5 can never be zero, we find 

(2a: 2 + 3)(a; + 5) (a; — 7) ^0. 

Therefore, if (2a: 2 + 3)(x + 5) (a: — 7) = 0, then either x = —5, or x = 7. 

2. We shall prove the contrapositive of the given statement. Let x and y be real numbers 
such that xy = 0. Then either x = 0 or y = 0. Therefore, if x 7^ 0 and y 7^ 0, then xy ^ 0. 

3. Assume x 2 > 49, we want to prove that |aa| > 7. Suppose, on the contrary, \x\ < 7. This 
means —7 < x < 7. We need to study two cases. 

• If —7 < x < 0, we find 0 < x 2 < 49. 

• If 0 < x < 7, we find 0 < a; 2 < 49. 

In both cases, we have x 2 < 49. This contradicts the given assumption that x 2 > 49. 
Therefore, we must have |a;| > 7. 
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4. Suppose there exist some positive numbers x and y such that yjx + y = y/x + y/y, then 

x + y= (y/x + y/y) 2 = x + 2y/xy + y, 


implying that 


0 = 2 y/xy, 


which is possible only when xy = 0. But both x and y are positive, hence, xy / 0. This 
contradiction shows that y/x + y / y/x + y/y for all positive numbers x and y. 


5. Suppose y/3 is rational, then we can write 


v / 3 = 


m 

n 


for some positive integers to and n such that m and n do not share any common divisor 
except 1 (hence, / is in its simplest term). Squaring both sides and cross-multiplying 
gives 

3 n 2 = m 2 . 

Thus 3 divides m 2 , consequently 3 must also divide to. Then we can write to = 3s for 
some integer s. The equation above becomes 


3n 2 = to 2 = (3s) 2 = 9s 2 . 


Hence, 

n 2 = 3s 2 , 

which implies that 3 divides n 2 , thus 3 also divides n. We have proved that both to and 
n are divisible by 3. This contradicts the assumption that to and n do not share any 
common divisor. Therefore, y/3 must be irrational. 

6. (=>) If n is odd, we can write n = 2k + 1 for some integer k. Then 

n 2 = (2k + l) 2 = 4 k 2 + Ak + 1 = 2(2 k 2 + 2k) + 1, 

where 2 k 2 + 2k is an integer. Hence, n 2 is odd. 

(4=) We shall prove its contrapositive: if n is even, then n 2 is even. If n is even, we can 
write n = 2k for some integer k. Then 

n 2 = (2k) 2 =4 k 2 = 2-2 k 2 , 

where 2 k 2 is an integer, which means n 2 is even. 


Section 3.4 


1. We proceed by induction on n. When n = 1, the left-hand side reduces to 1 • 2 = 2, and 
the right-hand side becomes = 2. Hence, the identity holds when n = 1. Assume the 
identity holds when n = k for some integer k > 1; that is, assume 


1 • 2 + 2 • 3 + 3 • 4 + • • • + k(k + 1) 


k(k + l)(k + 2) 
3 


for some integer k > 1. We want to show that it also holds when n = k + 1; that is, we 
want to show that 


(k + 1 )(k + 2 )(k + 3) 


l-2 + 2- 3 + 3- 4 + -- - + (fc + 1 )(k + 2) 


3 
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It follows from the inductive hypothesis that 

1 • 2 + 2- 3+ ••• + (k+ 1)0 + 2) = 1 • 2 + 2 • 3 H + k(k + 1) + (fe + l)(fc + 2) 

k(k + l)(fc + 2) 

3 


+ (k + l)(fc + 2) 


— (k + 1 )(k + 2) ( — + 1 


— (k + l)(fc + 2) 


k 
3 

k + 3 


This completes the induction. 

2. We proceed by induction on n. When n = 1, the left-hand side reduces to 1 • 2 • 3 = 6, and 
the right-hand side reduces to 1,2 4 3 ’ 4 = 6. Hence, the identity holds when n = 1. Assume 
it holds when n = k for some integer k > 1; that is, assume 


+ !)(* + 1 2 ) = 


/c(fc + 1 )(k + 2 )(k + 3) 


i=l 


for some integer k > 1. We want to show that it also holds when n = k + 1; that is, we 
want to show that 

J2 i(i + l)(z + 2) = (k + 1)(fc + 2)(k + 3)(k + 4) . 


It follows from the inductive hypothesis that 


fc+i 


+ 1)(* + 2) — E i{i + 1)(* + 2) 1 + (fc + 1 )(fc + 2)(/e + 3) 




Vi=l 


k{k + 1 )(k + 2 )(fc + 3) 


+ (fc + l)(fc + 2 )(fc + 3) 


(A: + l)(fc + 2)(fc + 3) ( — + 1 


— (/c + l)(fc + 2)(fc + 3) 


k 
4 

k + 4 


This completes the induction. 

3. We proceed by induction on n. When n = 1, the left-hand side reduces to 1 + 4 = 5, and 
the right-hand side becomes |(4 2 — 1) = 5. Hence, the identity holds when n = 1. Assume 
it holds when n = k for some integer k > 1; that is, assume that 

1 + 4 + 4 2 -| b4 fc = - (4 fc+1 - 1) 

3 

for some integer fc > 1. We want to show that it also holds when n = k + 1; that is, we 
want to show that 

1 + 4 + 4 2 H h 4 fc+1 = l (4 fe+2 - 1). 

o 

It follows from the inductive hypothesis that 

1 + 4 + 4 2 H b 4 fe+1 = 1 + 4 + 4 2 H b 4 k + 4 fe+1 

= | (4 fe+1 - 1) + 4 fc+1 

= i (4 fe+1 - 1 + 3 • 4 k+l ) 

= i (4 • 4 fe+1 - 1) 

= 4 (4 fc + 2 — 1). 


This completes the induction. 
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Section 3.5 

1. We use induction to prove the claim. Note that n 2 + 3n + 2 = 6 when n= 1, so the claim 
is true. Assume it is true when n = k for some integer k > 1, so we can write 

k 2 A 3 k A 2 = 2 g 

for some integer q. We want to show that the claim is still true when n = k + 1, that is, 

( k A l) 2 A 3(fc A 1) A 2 = 2 Q 

for some integer Q. We find 

(fc + l) 2 A 3(fc A 1) A 2 = k 2 A 5k A 6 

= ( k 2 + 3 k A 2) + 2k A 4 

= 2 q A 2 k A 4 
= 2(q A k A 2), 

where q A k A 2 is an integer. Thus, the claim is still true when n = k A 1, thereby 
completing the induction. 

2. Proceed by induction on n. Since 1 < 2, the inequality is valid when n = 1. Assume it is 
valid when n = k for some integer k > 1; that is, assume 

k< 2 k 

for some integer k > 1. We want to show that 

k Al < 2 fc+1 . 

Notice that for k > 1, we have 1 < 2 fe . Hence, it follows from the inductive hypothesis 
that 

k A 1 < 2 fc A 1 < 2 fc A 2 fc = 2 • 2 fc = 2 fc+1 . 

This completes the induction and the proof of the given inequality. 

3. Proceed by induction on n. When n = 0, the LHS of the identity reduces to 1, and the 
RHS of the identity becomes 3(l — |) = 3 • | = 1. Thus, the identity holds when n = 0. 
Assume it holds when n = k for some integer k > 0. That is, assume 


, 2 4 

1+ 3 + 9 + ' 


A 


= 3 




fc+i' 


for some integer k > 0. We want to show that it also holds when n = k A 1. That is, we 
want to show that 


1 2 4 

1+ 3 + 9 


1) 


k + 1 


= 3 


1- “ 


k + 2 


According to the inductive hypothesis, 

fe+i 


, 2 4 

1+ 3 + 9 


I) 


1 2 4 

1+ 3 + 9 


= 3 


= 3 


1 - 


1 - 


fe+T 


I) 


k+ 1 


2\ fe+1 

3/ 


fc+1 


fe+1' 
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This completes the induction and the proof of the given identity. 

Section 3.6 

1. Proceed by induction on n. When n = 1,2, the proposed formula for c n says C\ = 
5 • 3 — 4 • 2 = 7, and C 2 = 5- 9 — 4-4 = 29. They agree with the given initial values, so the 
formula holds for n = 1,2. Assume the formula is valid for n = 1,2 , ,k for some integer 
k > 2. In particular, assume 

c k = 5-3 fc -4-2 fe , and Cfc _i = 5 • - 4 • 2 k ~ x . 

We want to show that the formula still works when n = k + 1. In other words, we want to 
show that 

c k+1 = 5 • 3 fc+1 — 4 • 2 k+1 . 

Using the recurrence relation and the inductive hypothesis, we find 

Ck-\-\ — 6cfc_i 

= 5(5 • 3 fc - 4 • 2 fc ) - 6(5 • - 4 • 2 k ~ 1 ) 

= 25 • 3 fc — 20 • 2 k — 30 • 3 fc_1 + 24 • 2 fc_1 

= 25 • 3 fc — 20 • 2 fc — 10 • 3 • 3 fe_1 + 12 • 2 • 2 fc " 1 

= 25 • 3 fc — 20 • 2 k - 10 • 3 fc + 12 • 2 k 

= 15 • 3 fc - 8 • 2 k 

= 5 • 3 • 3 fc — 4 • 2 • 2 fc 
= 5 • 3 fc+1 — 4 • 2 fc+1 , 

which is what we want to establish. This completes the induction, and hence, the claim 
that b n = 2 n + 3™. 

2. Proceed by induction on n. The claim is true for n = 2,3, because 

2 = 2 - 1 + 3- 0, 

3 = 2 • 0 + 3 • 1. 

Assume the claim holds when n = 2, 3, . . . , k for some integer k > 3. In particular, since 
k — 1 > 2, we may assume that 

k — 1 = 2x + 3y 

for some nonnegative integers x and y. We want to show that the claim is still true when 
n = k + 1. We find 

k + 1 = (k — 1) + 2 
= (2x + 3 y') T 2 

= 2(x + 1) + 3y, 

where x+\ and y are nonnegative integers. Therefore, the claim is still true when n = k+ 1. 
This completes the induction. 
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Section 4.1 

1. {-4, —3, -2, -1, 0, 1, 2, 3, 4}, and {1, 2, 3, 4}. 

2. {1,4,9,16}. 

3. {-..,-5, -3, -1,1, 3, 5,...}. 

4. {...,-9, -6,-3, 0,3, 6, 9,...}. 

5. Only {x £ R | 1 < x < 7} can be represented by the interval notation (1,7), because we 
have to include all the real numbers between 1 and 7. 

6. Because [2,7] = {i € K | 2 < i < 7} includes decimal numbers and integers, but 
{2, 3, 4, 5, 6, 7} contains only integers. 

7. False, because the interval (—2, 3) contains decimal numbers as well as integers, but the 
set { — 1, 0, 1, 2} contains only integers. 

8. Z-. 

9. The notation [7, 7] means {x € R. | 7 < x < 7}. Since equality is allowed, this set contains 
only one number, namely, the number 7. In other words, [7, 7] = {7}. But the sets (7, 7), 
(7, 7] and [7, 7) are empty. 

10. Both sets have two elements. The elements of {0, {1}} are 0 and {1}, one of them is an 
integer, the other is a set. The elements of {{0}, {1}} are the two sets {0} and {1}. 

11. (a) 0 (b) the set is infinite (c) 1 

12. It is incorrect to say |0| = 0 because |0| is a number (its value is 0), but 0 is a set, they 
are incompatible. 

Section 4.2 

1. (a) true (b) true 

2. False, because 3 € [3,4) but 3 ^ (3,4). 

3. Since (3, 4) consists of numbers strictly between 3 and 4, every number we can find in 
(3,4) also appears as a member of [3,4]. However, the interval [3,4] also contains the two 
numbers 3 and 4, which are not members of the interval (3,4). Therefore, it is true that 
(3,4) c [3,4]. Likewise, we also have (3,4) c (3,4]. 

4. (a) According to Theorem 4.2.2, the empty set is the subset of any set, including {0}. 
Thus, the statement is true. 

(b) For S C T, every element of S must be an element of T as well. Here, the set {1} has 
only one element: the number 1, which is also an element of |l,{l,2}}. Therefore, the 
statement is true. 

(c) This time, 1 does not appear in {{1},{1,2}} as an element. Notice that {{1},{1,2}} 
has two elements, both of which are sets, namely, {1} and {1, 2}. Therefore, the statement 
is false. It would have been true if it were {1} € {{1}, {1, 2}}. 

5. The completed table is listed below. 


size 

subset 

0 

0 

1 

{1},{2},{3},{4} 

2 

{1,2}, {1,3}, {1,4}, {2, 3}, {2, 4}, {3, 4} 

3 

{1,2, 3}, {1,2, 4}, {1,3, 4}, {2, 3, 4} 

4 

{1,2, 3, 4} 
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The final answer is 

p({l, 2, 3,4}) = {0, {1}, {2}, {3}, {4}, {1, 2}, {1, 3}, {1,4}, {2, 3}, {2, 4}, {3, 4}, 

{1,2, 3}, {1,2, 4}, {1,3, 4}, {2, 3, 4}, {1,2, 3, 4}}. 

6. The set 0 has no element, but the set {0} has one element (namely, the empty set). In 
terms of cardinality, |0| = 0, and |{0}| = 1. Yes, it is true that p(0) = {0}. 

7. There are 2 3 = 8 elements in p({a,/3, 7}). They are 

0, {<*}, {£}, {7}, {a,/?}, {a, 7}, {Ay}, and { a , A 7l- 

8. Since |0| = 0, the power set p(0) has only 2° = 1 element, which is 0 itself. Therefore, 

p(0) = {0}- 

9. Yes, because |A| is a number, and 2' A ' does equal to |p(A)|. The notation 2 A is illegal 
because A is a set, hence, it does not make much sense to raise 2 to a power that is not a 
number. 

Section 4.3 

1. AnB = {John}, AUB = {John, Mary, Dave, Larry, Lucy}, A — B = {Mary, Dave}, 
B — A = {Larry, Lucy}, A = {Lucy, Peter, Larry, }, B = {John, Mary, Dave}. 

2 . 0 . 

3. (a) Because { — 1, —2, —3, . . .} and {1, 2, 3, . . .} are sets, but 0 is not, we cannot form their 
set union. To fix it, we should write Z = {—1, —2, —3, . . .} U {0} U {1, 2, 3, . . .}. 

(b) This is worse than (a): all three components are not sets! Of course, it does not make 
much sense to take the union of things that are not even sets. To fix it, we need to insert 
curly braces (set brackets) as in (a). 

(c) Same problem as in (b), plus a wrong notation for set union. To fix it, insert curly 
braces and change the symbol + to U. 

(d) Same as (a). 

4. [-1,3) and (0,3). 

5. Solution 1 : Let x £ A n (B U C). Then xei, and x £ B U C. We know that x £ BL)C 
implies that x £ B or x £= C. So we have 

(i) x £ A and x £ B, or 

(ii) x £ A and x £ C; 

equivalently, 

(i) x £ An B, or 

(ii) x £ An C. 

Thus, x £ (inB)U(Jn C). We have proved that An(BnC) C (A n B) U (A n C). 

Now let x £ (An B) U (An C). Then x £ A n B or x £ AnC. From the definition of 
intersection, we find 

(i) x £ A and x £ B, or 

(ii) x £ A and x £ C. 

Both conditions require a; £ A, so we can rewrite them as 
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(i) x £ A, and 

(ii) x £ B or x £ C; 

equivalently, 

(i) x £ A, or 

(ii) x € B U C. 

Thus, x £ An (B U C). This proves that (An B) U (An C) C An (BU C). Together with 
A n (B U C) C (A n B) U (A n C), we conclude that A n (B U C) = (A n B) U (A n C). 

Solution 2: We note that 

x £ An (B n C) ^ x £ A r\ x £ (B n C) 

^x£A/\(x£B\Jx£C) 

(x £ A A x £ -£?) V (x £ A A x £ C ) 

(x £ A n B) V (x £ A n C) 
ox£(AnB)u(AnC) 

it follows that A n (B U C) = (AnB)U(An C). 

6. Assume A C B and ACC. We want to show that A C B n C. To achieve this goal, let 
x £ A. Since A C B, we also have x £ B. Likewise ACC implies that x £ C. Now x £ B 
and x £ C together imply that, according to the definition of set intersection, x £ B n C. 
We have proved that x £ A implies that x £ B D C; it follows that A C B D C. 

Section 4.4 

1. Ax B = {(a,r), (a,s), (a,t), ( b,r ), (b,s), ( b,t ), (c,r), (c,s), (c,t), (d,r), ( d,s ), (d,t)}, 

B x A = {(r, a), (r, b ), (r, c), (r, d), (s, a), (s, b), ( s , c), (s, d), (t, a), (t, 6), (t, c), (t, d)}, 

B x B = {(r, r), (r, s), (r, t), (s, r), (s, s), (s, t), ( t , r), (t, s), (t, t)}. 

2. {(a, 0), (a, {d}), (6,0), (6, {d}), (c,0), (c, {d})}. 

3. {(or, y) | 1 < x < 3, 2 < y < 4}. 

4- {(l,a,r),(l,a,s),(l,a,t), (l,6,r), (1,M),(1,M), 

(2, a, r), (2, a, s), (2, a, t), (2, 6, r), (2, 6, s), (2, b, t)}. 

5. {((l,a),r), ((1, a), s) , ((l,a),t), ((l,6),r), ((l,6),s), ((1 ,b),t), 

((2,a),r), ((2, a), s) , ((2,a),f), ((2,6),r), ((2 ,b),s), ((2 ,b),t)}. 

Section 4.5 

1. U ?=1 B i = [0, 2 n), and D?=i ^ = [0, 2). 

2- U£i = [0, oo), and fl^i B i = [0, 2). 

3- U=i Ci = [0, 1), and i Q = {0}. 

4- \JZi E i = (-oo, 2), and fl~i ^ = [-1, !]• 

5. U=i^ = N, andn~!^ = 0. 


(defn. of intersection) 
(defn. of union) 
(distributive law) 
(defn. of intersection) 
(defn. of union) 
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6. We find 

(J Ai = A 1 U A 4 U A 5 = {1, 4, 23} U {5, 17, 22} U {3, 6, 23} = {1, 3, 4, 5, 6, 17, 22, 23}, 

ieJ 

and 

Pi Ai = Ai n A 4 n A. 5 = {1,4, 23} n {5, 17, 22 } n {3, 6, 23} = 0. 

ieJ 

7. For I = {Mary, Joe, Lucy}, we have 

U = ^Mary U A Joe U A Lucy = {7, 11, 23} U {3, 6, 9} U {3, 6, 23} = {3, 6, 7, 9, 11, 23}. 

iei 

These will be the numbers on their Lotto tickets if Mary, Joe, and Lucy pool their money 
together. 

8. I leave them to you as exercises. 

9. The union (J ieJ A; represents the set of people who is friend to at least one student in I. 
The intersection A t represents the set of people who knows everyone in /. 

Section 5.1 

1. The subset (0, 1) does not have a smallest element. Thus [0, 1] is not well-ordered. 

Section 5.2 

1. (a) 18, 2 (b) —19, 5 (c) -25, 11 


b 

a 

b div a 

b mod a 

234 

15 

22 

4 

234 

-15 

-22 

4 

-234 

15 

-23 

11 

-234 

-15 

23 

11 


3. llg + 4, 2. 

4. Thursday. 


Section 5.3 

1.35 = 5-7, 35 = 8-4 + 3, 35 = 25-1 + 10, 14 = 7-2, -14 = 2 • (-7), 

14 = 14-1. 

2. When an odd integer is divided by 2, the remainder is 1. Hence, we have 

• If n is even, then n = 2 q for some integer q. 

• If n is odd, then n = 2q + 1 for some integer q. 

3. If n is not divisible by 3, then n = 3q + 1 or n = 3<? + 2 for some integer q. 

4. 1, 2, 3, 4, 6, 11, 12, 22, 33, 44, and 66. 

5. 27, 29, 31, 37, and 41. 

6. Assume a \ b and a \ c. There exist integers x and y such that b = ax and c = ay. Then 

be = ax ■ ay = a 2 ■ xy , 
where xy is an integer. Thus, a 2 | be. 
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Section 5.4 

1. The only common divisors of 3 and 5 are ±1. Hence, gcd(3, 5) = 1. 

2. The largest positive divisor of —8 is 8, which also divides 0. Thus, gcd(0, —8) = 8. 


By applying the theorem repeatedly, we 

have 


732 = 

153-4+ 120, 

gcd(732, 153) 

= gcd(153, 120) 

153 = 

120 • 1 + 33, 

gcd(153, 120) 

= gcd(120, 33) 

120 = 

33-3 + 21, 

gcd(120, 33) 

= gcd(33, 21) 

33 = 

21 • 1 + 12, 

gcd(33, 21) 

= gcd(21, 12) 

21 = 

12-1 + 9, 

gcd(21, 12) 

= gcd(12, 9) 

12 = 

9-1 + 3, 

gcd(12, 9) 

= gcd(9, 3) 

9 = 

3-3 + 0, 

gcd(9, 3) 

= gcd(3, 0) = 3. 

Therefore, gcd(732, 153) = 3. 



By applying division 

repeatedly, we find 




6958 

= 2478 • 2 + 2002 

gcd(6958, 2478) 

= gcd(2478, 2002), 

2478 

= 2002 -1 + 476 

gcd(2478, 2002) 

= gcd(2002, 476), 

2002 

= 476-4 + 98 

gcd(2002, 476) 

= gcd(476, 98), 

476 

= 98-4 + 84 

gcd(476, 98) 

= gcd(98, 84), 

98 

= 84-1 + 14 

gcd(98, 84) 

= gcd(84, 14), 

84 

= 14-6 + 0 

gcd(84, 14) 

= gcd(14,0) = 14. 


Therefore, gcd(6958, 2478) = 14. 

5. We find gcd(732, 153) = 3, as follows: 


4 

732 

153 

1 

612 

120 

3 

120 

33 

1 

99 

21 

1 

21 

12 

1 

12 

9 

3 

9 

3 



9 



0 


6. We find gcd(6958, 2478) = 14, as follows: 


2 

6958 

2478 


4956 

2002 

4 

2002 

476 


1904 

392 

1 

98 

84 


84 

84 


14 

0 


7. From the linear combinations 

7(5m + 7n) — 5(7m + 5n) = 24n, 
— 5(5m + 7n) + 7(7m + 5n) = 24m, 
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we know that gcd(5m + 7 n, 7 m + 5 n) divides both 24n and 24m. Since gcd(m, n) = 1, we 
conclude that gcd(5?n + 7 n, 7m + 5 n) divides 24. Thus, gcd(5?n + 7 n, 7m + 5 n) equals to 
1, 2, 3, 4, 6, 8, 12, or 24. 

8. The following computation 


Sk 

tk 

qk 



0 

1 




1 

0 

4 

732 

153 

-4 

1 

1 

612 

120 

5 

-1 

3 

120 

33 

-19 

4 

1 

99 

21 

24 

-5 

1 

21 

12 

-43 

9 

1 

12 

9 

67 

-14 

3 

9 

3 




9 





0 



1 

1 

1 


shows that 3 = gcd(153, 732) = 67 • 153 - 14 • 732. 


9. The following computation 


Sk 

tk 

Qk 



0 

1 




1 

0 

2 

6958 

2478 

-2 

1 

1 

4956 

2002 

3 

-1 

4 

2002 

476 

-14 

5 

4 

1904 

392 

59 

-21 

1 

98 

84 

-73 

26 

6 

84 

84 




14 

0 


shows that 14 = gcd(2478, 6958) = -73 • 2478 + 26 • 6958. 


Section 5.5 


1. -43 -133 + 40 -143 = 1. 

2. -512 -757 + 319 -1215 = 1. 

3. Suppose y/7 is rational, then we can write 

V7=- 

n 

for some positive integers m and n that do not share any common divisor except 1. Squar- 
ing both sides and cross-multiplying gives 

n 2 _„2 

7 n = m . 


Thus 7 divides m 2 . Since 7 is prime, Euclid’s lemma asserts that 7 must also divide m. 
Then we can write m = 7s for some integer s. The equation above becomes 

7 n 2 = m 2 = (7 q) 2 = 49g 2 . 


Hence, 


= 7 q 2 


which implies that 7 divides n 2 . Again, since 7 is prime, Euclid’s lemma implies that 7 
also divides n. We have proved that both m and n are divisible by 7. This contradicts 
the assumption that m and n do not share any common divisor. Therefore, \J 7 must be 
irrational. 
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Section 5.6 

1. Since 153 = 3-3-17, and 72 = 2 • 2 • 3 • 61, we determine that gcd(153, 72) = 3. 

2. By writing the factorizations as 

2 3 • 5 • 7 • ll 2 = 2 3 • 3° • 5 1 • 7 1 • ll 2 , 

2 2 • 3 2 • 5 2 • 7 2 = 2 2 • 3 2 • 5 2 • 7 2 • 11°, 


it becomes clear that gcd(2 3 • 5 • 7 • ll 2 , 2 2 • 3 2 • 5 2 • 7 2 ) = 2 2 • 3° • 5 1 • 7 1 • 11° = 4 • 5- = 140. 

3. We find lcm (2 3 • 5 • 7 • ll 2 , 2 2 • 3 2 • 5 2 • 7 2 ) = 2 3 • 3 2 • 5 2 • 7 2 • ll 2 = 10672200. 

4. We find lcm(246, 426) = ™ ' 426 = 246 ' 426 = 17466 . 

v ; gcd(246, 426) 6 

5. Since lcm(35,42) = 210, the two comets will return to Earth together in 2222. 

6. From the linear combinations 

6(4?n — 6 n) — 4(6 m + 4 n) = —52 n, 

4(4m — 6n) + 6(6?n + 4n) = 52 to, 

we know that gcd(4m — 6n, 6m + 4n) divides both — 52n and 52 m. Since gcd(m, n) = 1, 
we conclude that gcd(4m — 6n, 6m + 4n) divides 52. Consequently, gcd(4m — 6n, 6m + 4n) 
equals to 1, 2, 4, 13, 26, or 52. It follows that lcm(4m — 6n,6m + 4n) equals to mn, mn/2, 
mn/4:, mn/ 13, mn/ 26, or mn/52. 

7. Assume x £ 4Z (~l 6Z, then x £ 4Z and x £ 6Z. This means a: is a multple of both 4 
and 6. Consequently, a; is a multiple of lcm(4, 6) = 12, which means x £ 12Z. Thus, 
4Z n 6Z C 12Z. 

Next, assume x £ 12Z, then a: is a multiple of 12. Consequently, a: is a multiple of 3, as 
well as a multiple of 4. This means x £ 4Z, and x £ 6Z. As a result, x £ 4Z (~l 6Z. Thus, 
12Z C 4Z (~l 6Z. Together with 4Z f~l 6Z C 12Z, we conclude that 4Z (~l 6Z = 12Z. 

Section 5.7 

1. Wednesday. 

2. 13. 

3. 3. 

4. 8. 

5. There are five cases to consider: 


n (mod 5) 

n 5 — n (mod 5) 

0 

O 

II 

O 

1 

lO 

O 

1 

l 5 - 1 = 0 

2 

2 5 - 2 = 30 = 0 

3 

3 5 — 3 = 340 = 0 

4 

4 5 — 4 = 1020 = 0 


Therefore, for any integer n, we always have n 5 — n = 0 (mod 5), which means 5 | (n 5 — n). 

6. 7 45 = 7 32 • 7 8 • 7 4 • 7 = 5 • 9 • 3 • 7 = 10 (mod 11). 

7 . 9 58 = 9 32 • 9 16 • 9 8 • 9 2 = 18 • 8 • 13 • 12 = 16 (mod 23). 

8. 17. 


9. 38. 
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Section 6.1 

1. The domain is R, and the co domain is Z. 

2. The range is R + U {0}. Hence, the square root function is not onto. 

3. No, because R + is a set, and 0 is a number. We can only take union of two sets. 

Section 6.2 

1. Only / is a well-defined function. The rule for g does not assign any value to a. In other 
words, g(a ) is undefined. For h, two values are associated to b. That is, there are two 
possible values for the image h{b), which is not allowed. 

2. No, r is not a well-defined function because the value of r(x) should be the same regardless 
of which day of the week it is. 

3. No, s is not a well-defined function because the images s(x) are undefined for 2 < x < 3. 

4. We also have 

n({a, &}) = n({a, d}) = n({fo, c}) = rc({c, d}) = 2. 

The value of n(S) must be between 0 and 4, inclusive. 

5. 


0 12 3 4 

0 / 1 0 0 0 0 \ 

1 0 0 0 1 0 

2 0 1 0 0 0 

3 0 0 0 0 1 

4 0 0 1 0 0 

5 1 0 0 0 0 

6 0 0 0 1 0 

7 0 1 0 0 0 

8 0 0 0 0 1 

9 \ 0 0 1 0 0 / 




Section 6.3 

1. Assume g(x i) = g(x 2 ), then 

5 — 7aq = 5 — 7x2, 
which clearly implies X\ = X2- Hence, g is one-to-one. 

2. Assume h(x 1 ) = h(x 2 ), then 



Squaring both sides yields aq — 2 = X 2 — 2, which clearly implies X\ = aq. Hence, h is 
one-to-one. 

3. Assume k( aq) = ^(aq), then 

lnaq = lnaq. 

Raising both sides to the power of e, we find 

glnxi _ gin 2:2 

which simplifies to aq = aq. Thus, k is a one-to-one function. 
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Alternatively, we can look at the derivative k'(x) = 1/x. Since k'(x) > 0 for all x > 0, the 
function k is increasing. Thus, k is one-to-one. 

4. First, we use the two-point form to find the equation of the line: 

y- 2 5-2 3 

a; — 3 1 — 3 2' 

This simplifies to y = — |x + 4^. However, this is not the correct answer. We want a 
function, so the answer should be 

/: [1,3] — t [2,5], f(x) = ~^x+ y. 

5. There are many possible answers, we only give two here, their graphs are the straight lines 
that join the opposite corners of the rectangle framed by the domain and codomain. The 
straight line joining the two corners (3, 2) and (8, 5) yields the example 

/: [3,8] -A [2,5], /(x) = |x+^, 

and the line joining the two corners (3,5) and (8,2) gives the example 

3 34 

9- [3,8] -> [2,5], g(x) = ~-x + 

6. Assume h(x 1) = h(x 2), then 

Ax\ — 11 = 4^2 — 11 (mod 15). 

Adding 11 to both sides yields 

4a: 1 = 4x2 (mod 15). 

Multiplying 4 to both sides leads to 

16x’i = 16x2 (mod 15), 

which simplifies to x\ = X2 (mod 15). Therefore, h is one-to-one. 

7. We find, for example, k( 3) = k{ 6) = 4 (mod 15). Hence, k is not one-to-one. 

8. Assume h(n\) = h(n 2 )- Since the image is either odd or even, we have to consider two 
cases. 

• If both h(n 1) and h(n 2 ) are odd, then 

2ni + 1 = 2n 2 + 1, 

hence, ni = n 2. 

• If both h(ni) and h(n 2) are even, then 

— 2ni = — 2ri2, 

hence, n-[ = n 2. 

In both cases, we Hnd n\ = ri2- Therefore, h is one-to-one. 
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Section 6.4 

1. We can use, for example, a straight line graph that connects the points (1,2) and (3,5). 
This leads to the function /: [1, 3] — > [2, 5] defined by f(x) = \ x + 

2. We can use, for example, a straight line graph that connects the points (1,2) and (3,4), 
see Figure 6.4. 

3. [0, oo). 

4. The midpoint of the interval (2,9) 
x — 4r shifts the interval to (— |, \ 

5. Let y = 3a: + 11, then 


which is, of course, an element of the domain. Hence, g is onto. 

6. It is obvious that the graphs y = 3x + 1 and y = Ax are increasing. For x < 2, the y - values 
cover the range (— oo,7). For x > 2, the y - values cover the range (8, oo). Hence, the 
y-v alues in the interval [7, 8] are never used as images. For example, there is no x-value 
which would give f(x) = Therefore, / is not onto. 

7. Let y = 5x + 8 (mod 23). Then 

5x = y — 8 (mod 23) . 

Since 5” 1 = 14 (mod 23), we find 

x = 5 ~ 1 (y — 8) = lA(y — 8) (mod 23). 

Therefore, h is onto. 

8. No! Since v(n) > 2, there does not exist n G N such that v(n) = 1. 

9. No! Someone in the tree would have no daughter. She could be you, or your sisters, or 
one of their infant daughters, or someone higher up in the tree. For this individual y, we 
cannot find any x such that h\{x) = y, because this would make x a daughter of x. 


is and its width is 7. The transformation of x to 
). Hence, we can set h(x) — tan [y (x — -g-)] . 


Section 6.5 


1. (ll,oo); R. 


x — x — 7 = 1 x — > — - 


29 


29 


2. Since 


we determine that img = ( — 00 ). 

3. Remember that h({ 0, 3, 4}) is a set, so we need to use a set notation. The answer is {4, 9}. 


4. {0,1, 2, 3, 4, 5}. 

5. Let y € f{C\ n C 2 ), we want to show that y € f(Ci) PI /(C 2 ). Having y G f(C\ fl C 2 ) 
means there exists x € C\ fl C 2 such that f(x) = y. Now that x € C\ fl C 2 requires x € C\ 
and x & C' 2 - 


• For x € Ci, we find y = f(x) G f(C\)- 

• For x G C 2 , we find y — /( x) G /(C 2 ). 
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We conclude that y = f{x) belongs to both f(Ci) and f(C 2 ). Thus, f(x) G f{C\) 0 f(C 2 ), 
proving that f{C\ O C 2 ) Q f(Ci) (~l f(C 2 ). 

6 . We want to find x such that 

x 2 — x — 7 = 3. 

This is equivalent to solving the equation 

x 2 — x — 10 = 0. 

Its solutions are x = lj= £ Q. Therefore, fc _ 1 ({3}) = 0. 

7. h-'m) = {0, 3, 6 , 9, 12}; h~\{ 2}) = 0. 

8 . First, we want to prove that f~ 1 (D\ PiD 2 ) C / _ 1 (I? 1 ) nf~ 1 (D 2 ). Let x € / - 1 (Z?i C\D 2 ), 
then f(x) G Di n D 2 . This means either f(x) € D x and f(x) G D 2 . 

• For f(x) G Di, we find x G 

• For f(x) G D 2 , we find x G f~ 1 (D 2 ). 

Since x belongs to both f~ l {Di) and f~ 1 (D 2 ), we determine that x G f~ 1 (Di)nf~ 1 (D 2 ). 
Therefore, f~ 1 {D l D D 2 ) C n f~\D 2 ). 

Next, we want to prove that / - 1 (Z?i) D f~ 1 (D- 2 ) C / _1 (D 1 n D 2 ). Let x G n 

f~ 1 (D 2 ). Then x belongs to both and x G f~ 1 (D 2 ). 

• For x G f~ 1 (D 1 ), we find f(x) G D\. 

• For x G f~ 1 (D 2 ) , we find /( x) G D 2 . 

Hence, f(x) belongs to both D\ and D- 2 , which means f(x) G D\ fl D 2 . Thus, x G 
f~ 1 (Di n D 2 ). We have proved that /^ 1 (Di) fl /~ 1 (D 2 ) C / _ 1 (D 1 n D 2 ). Together with 
/ _ 1 (DinD 2 ) C f- 1 {D 1 )nf~ 1 (D 2 ), we conclude that f~ 1 {D 1 r\D 2 ) = f- 1 (D 1 )r\f~ 1 {D 2 ). 


Section 6.6 

1. ,f~ 1 :[ 0, 00 ) -A [-3,oo), f~ l {x) = x 2 - 3. 

2 . g- 1 : ( 0 , 00 ) -» K, 9 ~ 1 {x) = ln;r. 


3. Following the same idea used in Example 6.6.3, we find 


g g x (x) 


| (x — 5) if x < 23, 
| (x + 7) if x > 23. 


4. Let y = 49a; — 3 (mod 57). Interchanging x and y yields 

x = 49 y — 3 (mod 57). 


Hence, 

y = 49^ 1 (a: + 3) = 7(x + 3) (mod 57). 
Therefore, h~ 1 :Z 5 7 -A- Z57 is defined by h~ 1 (x) = 7(x + 3) mod 57. 

5. Following the same idea used in Example 6 . 6 . 6 , we find 


r 1 : N^z, / _ 1 (n) 


n 

2 

n— 1 
2 


6 . ( 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 ); { 01 , 03 , 04 , 05 }. 


if n is even, 
if n is odd. 
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Section 6.7 

1. We find p o q: R. — » R and q o p: R — » R defined by (p o q)(x) = 2a: 2 + 7, and (g o p)(x) = 
4x 2 + 20x + 26. 

2. Direct computation yields 

f(g( x)) = 7g(x) + 2 = 7(5a: — 3) + 2 = 11a: + 5 (mod 12). 

Hence, fog : Zi 2 — > Zi 2 is defined by (/ o g)(x) = 11a: + 5 (mod 12). 

3. Since (/ o g)(x) = f(g(x)) = 3g(x) + 2, the function / o g: R — » R is defined by 

( f°9){x ) = 

4. The composite function / o g: Z — ► Z is defined by 

( /o 9 )(n) = | n + 2 ifniso<Jd 

This is how we obtained the answer. If n is even, then /(n) = n + 1 is odd, so we have to 
use the second branch in g to evaluate g(f(n)). We find, for even n, g(f(n)) — g{n + 1) = 
(n + 1) — 7 = n — 6. In a similar manner, when n is odd, f(n) = n — 1 is even, therefore 
=g(n-l) = (n-l) + 3 = n + 2. 

5. The composite function h o g is easy to obtain: 

ftoy:Z->R, (hog)(x) = (\/\x\ - 5) 2 . 

To compute g o ft,, we start with ft, whose codomain is R. This means the result from ft 
could be a real number. But the domain of g is Z, therefore g o ft is not a well-defined 
composite function. 

6. We find 


( 3x 2 + 2 if x < 5 

\ 3(2x — 1) + 2 if a; >5 

f 3a; 2 + 2 if x < 5 
\ 6x — 1 if x > 5 


(/° g)(x) = f(g(x)) = e^=e lnx = x, 

(S°/)W = 9(f(x)) = ln/(x) = In e x = x. 

Therefore, / and g are inverse functions of each other. 

Section 7.1 

1. False, false, true, true, true. 

2. Yes, we can write either (2,0.5) £ G, or 2G0.5. 

No, (4, 0.5) ^ G, which can also be written as 4 ^0.5. 

No, (10, 3) ^ G, and we can also write 10 (£3. 

3. No, (0,3) i G, or 0 j£3. No, (1,-1) £ G, or 1 Q - 1. Yes, (^=, y/2) £ G, or -j -Gy/2. 

4 . S = {(2, 2), (2, 4), (2, 6), (2, 8), (2, 10), (2, 12), (3, 3), (3, 6), (3, 9), (3, 12), 

(4,4), (4, 8), (4, 12)}. 

5. domS'= {2,3,4} = S - {7}, imS = {2, 3, 4, 6, 8, 9, 10, 12} = S - {1, 5, 7, 11}. 
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2 

3 

4 
7 


12 3 4 

/ 0 1 0 1 
0 0 10 
0 0 0 1 
\ 0 0 0 0 


5 6 7 8 9 

0 10 10 
0 10 0 1 
0 0 0 1 0 

0 0 0 0 0 


10 11 12 

1 0 1 \ 

0 0 1 

0 0 1 

0 0 0 / 


7. MATH CSIT MATH MATH CSIT 
210 121 223 231 120 



John Mary Paul Sally 



O 

t-H 

E j 

co 

Cl 

t-H 

co 

o 


cu 

CM 

Cl 

Cl 

CM 


tn 

i-H 

K 

w 

i—l 


EH 

EH 

EH 

EH 

EH 


<1 

1— 1 
cn 



1— 1 

CD 


2 

o 

S 

§ 

o 

John 

( 1 

l 

1 

0 

0 ' 

Mary 

1 

l 

0 

1 

0 

Paul 

0 

0 

1 

1 

1 

Sally 

l 1 

0 

0 

0 

1 


Section 7.2 

1. We find the following. 

• The relation R is reflexive since a < a for all a; hence, it cannot be irreflexive. 

• It is not symmetric, because 2 < 3 but 3^2. 

• Since a < b and b < a does imply that a = 6, the relationi? is antisymmetric. 

• Finally, it is transitive because a < b and b < c imply that a < c. 

The relation R is reflexive, antisymmetric, and transitive. 

2. This is how the analysis may proceed: 

• Since a ■ a = a 2 is always positive for any a € R* (question: is it still true if A = R?), 
the relation S is reflexive, hence, it is not irreflexive. 

• Since ab = ba, it follows that whenever ab > 0, we also have ba > 0. Therefore, S is 
symmetric. 

• However, S is not antisymmetric. For example, 2 • 3 > 0 and 3 • 2 > 0, but 2^3. 

• Since ab > 0 only when a and b have the same sign, if we also have be > 0, then c 
must have the same sign as 6, hence, the same sign as a, which in turn implies that 
ac > 0. Thus, S is transitive. 

The given relation is reflexive, symmetric, and transitive. 

3. The is how the analysis goes: 

• The relation T is reflexive because a \ a for any positive integer a. Consequently, T 
is not irreflexive. 

• Since 2 | 6 but 6 j 2, we find T non-symmetric. 

• However, if a \ b and b | a, we must have a = b, thus T is antisymmetric. 

• Finally, a \ b and b \ c do imply that a \ c, hence, T is transitive. 

The relation is reflexive, antisymmetric, and transitive. 

4. The argument is similar to Hands-On Exercise 7.2.3, the relation is reflexive and transitive. 

5. The argument is similar to Hands-On Exercise 7.2.3, the relation R is reflexive, antisym- 
metric, and transitive. 


6. We obtain these conclusions: 
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• Anyone and himself (or herself) must have the same last name, hence, W is reflexive, 
which immediately implies that W cannot be irreflexive. 

• If two people a and b have the same last name, then so are b and a. Thus, W is 
symmetric. 

• It is not antisymmetric. For example, John Doe and Jane Doe are two different 
persons having the same last name. 

• It is obvious that W is transitive. 

This relation is reflexive, symmetric, and transitive. 

Section 7.3 

1. The proof is similar to Example 7.3.2. 

2. The proof is similar to Example 7.3.2. 

3. [0], [1], [2], [3], [4], [5]. 

4. [0], [1], [2], . . . , [n — 1]. 

5. Example 7.2.5: [1] = Q, and [a:] = a;Q, where x is any irrational number. 

Example 7.2.7: [0] and [1]. 

Hands-On Exercise 7.2.2: [1] = M + , and [—1] = R - . 

Hands-On Exercise 7.2.6: each equivalence class consists of individuals with the same last 
name. 

6. Since x — x = 0 is an integer, we find x ~ x, hence, ~ is reflexive. If x ~ y, then x — y = m 
for some integer to. It follows that y — x = —(x ~ y) = —to, where —to. is an integer. 
Hence, y ~ x as well, which means ~ is symmetric. If x ~ y and y ~ z, then x — y = m 
and y — z = n for some integers to and n. Then 

x — z = (x — y) + (y — z) = to. + n 

is an integer. Hence, x ~ z, which means ~ is transitive. Therefore, ~ is an equivalence 
relation. 

7. The proof is identical to that of Hands-On Exercise 7.3.6. However, —2.14 ^ [5.14], because 
5.14- (-2.14) = 7.28 (£ Z. 

8. Two points are related if they both lie on the same line y = 5x + b for some specific b. 
To obtain a more precise formulation, let (*i,j/i) and (£ 2 , 2 / 2 ) be the two points. Then 
yi = 5a; 1 + b and y -2 = 5^2 + b. Since b is fixed, we find b = yi — 5x 1 = y -2 — 5x2- Therefore 

(* 1 , 2 / 1 ) ~ (* 2 , 1 / 2 ) ^ 2/1 - 5*i = 2/2 5x 2 

is the relation induced by the partition. 

9. From the incidence matrix 



1 

2 

3 

4 

5 

6 



1 

4 

2 

5 

6 

3 

1 

( 1 
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0 

1 

0 

0 ^ 


1 

f 1 

1 
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0 

0 

0 \ 

2 

0 

1 

0 

0 

1 

1 
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1 

1 

0 

0 

0 

0 

3 

0 

0 

1 

0 

0 

0 


2 

0 

0 

1 

1 

1 

0 

4 

1 

0 

0 

1 

0 

0 


5 

0 

0 

1 

1 

1 

0 

5 

0 

1 

0 

0 

1 

1 


6 

0 

0 

1 

1 

1 

0 

6 

l 0 

1 

0 

0 

1 

1 ) 


2 

V 0 

0 

0 

0 

0 

1 / 


it is clear that the relation S is an equivalence relation, and its equivalence classes are 
[1] = {1,4}, [2] = {2, 5, 6}, and [3] = {3}. 
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10. {(a, a), (a,d), (b,b),(b,c),(b,g), (c,6), (c,c), (c,#), (d,a), (d,d), 

(e, e), (e, /), (/, e), (/, /), (5, b), (g, c), (5,3)} 

Section 7.4 

1. The “divides” relation is reflexive and transitive over Z*, but it is not antisymmetric. For 
example, (—2) | 2, and 2 | (—2); yet —2 ^ 2. 

2. The relation C is reflexive and transitive, but not antisymmetric. For example, {a, b} C 
{fr}, and {6} C {a,&}; yet {a,b} ± {fe}. 

3. The Hasse diagram is displayed below, on the left. 



16 


4 


2 


1 


4. The Hasse diagram is displayed above, on the right. 

5. The Hasse diagram for the poset (p({a, 6, c}), C) is shown below. 


{a, b, c} 




{a,&} {a, c} {b, c} 




{«} {b} {c} 




0 


Section 8.2 

1. 7, 11. 



Appendix A Solutions to Hands-On Exercises 


275 


2. Let A and B denote the sets of people who attended the two games, then we are looking 
for the value of \A U B\. Since \A\ = 72397, \B\ = 69211 and \A 0 B\ = 45713, we find 

\AUB\ = \A\ + \B\ - | A n B\ = 72397 + 69211 - 45713 = 95895. 

Conclusion: 95895 different people attended the two games. 

3. This time, we have |A| = 72397, \B\ = 69211 and \A U B\ = 93478, hence, 

\AnB\ = \A\ + \B\ -\AUB\ = 72397 + 69211 - 93478 = 48130. 

Conclusion: 48130 people attended both games. 

4. The answer is (47 + 43 + 32) - (33 + 27 + 25) - 22 = 59. 

5. 100000. 

6. 999. 

7. 9-9-8-7-6-5 = 136080. 

8. There are 9 • 10 • 10 = 900 natural numbers with 3 digits. Among them there are 10 of 
the form 44a;, and 9 of the form a;44, for some digit x. However, 444 is counted in both 
groups, so there are actually 10 + 9—1 = 18 integers with repeated 4s. Hence, the number 
of 3-digit natural numbers that do not have repeated 4s is 900 — 18 = 882. 

9. (a) There are 26 choices each for the first two letters, and 10 choices for each of the 
remaining four digits. Hence, there are 26 2 • 10 4 choices for the PINs. 

(b) There are seven cases: from 0 to 6 digits following the first two letters, thus the total 
count is 26 2 (1 + 10 + 10 2 + • • • + 10 6 ) = 26 2 (10 7 - l)/9. 

(c) Similar to (b), the total number of PINS equals to 26 2 (10 2 + 10 3 + • • • + 10 6 ) = 
26 2 • 10 2 (1 + 10 + • • • + 10 4 ) = 260 2 (10 5 - l)/9. 

Section 8.3 

1. 21 • 20 • 19 • 18 = 143640. 

2. 6 • 5 • 4 • 3 = P(6, 4) = 360. 

3. 15 s ; 15 • 14 • 13 • 12 • 11 = P(15, 5) = 360360. 

4. 7! = 5040. 

5. 71/2 = 2520. 

Section 8.4 

L ( 4 3 2 ) = = 2-11.10 = 220. 

2. The order in which the committee members are selected does not matter. This problem 
essentially counts the number of 3-element subsets. The answer is ( 7 ) . 

3. There are ( 23 ) subsets with 5 elements. 

4. ( 525 ) = ( 5 f) = 529 ' 5 ll 2 % 7,526 = 3226076876. 

5- (S). 

6. P(10, 4). 
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8. A bridge hand is a 13-element subset, so it is a combination problem. The four spades can 
be chosen in ( 4 3 ) ways. The remaining nine cards must be selected from the remaining 39 
cards (the non-spades), hence, they can be chosen in ( 3 g 9 ) ways. Together, we determine 
that the number of bridge hands with exactly four spades is ( 1 4 3 ) ( 39 ) . Note that the upper- 
numbers add up to 52, the total number of cards available, and the lower numbers add up 
to 13, the total number of cards selected. 


9. The spades can be selected in ( 4 3 ) ways, and the hearts in ( 4 3 ) ways. The remaining five 
cards must be selected from the remaining 26 cards other than spades and hearts, they 
can be selected in ( 2 ®) ways. Hence, there are ( 4 3 ) ( 4 ) ( 2 5 6 ). Again, take note that the 
upper numbers add up to 52, and the lower numbers add up to 13. 


10. Following the same approach in the last two hands-on exercises, we find the total number 
to be ( 4 3 ) (g 3 ) (g 3 ) (g 3 ), which can be written as ( 4 3 ) (g 3 ) . 

11. There are ( 3 ) ways to choose the two blue balls. The other three balls must be either 
red or green, so we have to choose 3 balls from 6 + 5 = 11 balls. There are (g 1 ) choices. 
Together, there are ( g ) (g 1 ) selections with exactly two blue balls. 

12. This is a combination problem, because we are selecting soda cans without worrying about 
the order of selection. 

(a) The 4 cans of Pepsi can be selected in ( 4 ) ways. The other 6 cans can be selected from 
the remaining 16 cans, and there are (g) ways to do so. The total number of selections 
is therefore ( 4 ) (g 6 ) . Note that the upper numbers add up to 24, the total number of soda 
cans, and the lower numbers add up to 10, the number of cans selected. 

(b) “At least 4 cans of Pepsi” means we can choose from 4 to 8 cans. Following the 
argument used above, the number of selections is 



which can be written as X]fe =4 (D (io 1 -*)- 

(c) This time, the number of Pepsi is between 0 to 4 cans. The number of selections is 



or simply £t=o (|) (io-fc) - 

(d) This is a more elaborate version of the previous problems. The 3 Pepsi cans can be 
selected in (®) . Now we have to pick the other 7 cans. The number of Sprite could vary 
from 0 to 3 cans. Once we have picked the Sprite cans, the other cans must be selected 
from the remaining 9 cans of Dr. Pepper and Mountain Dew. Thus, the total count is 



Using the sigma notation, we can write ( 3 ) J2k=o ( I ) (r-k) • 


Section 8.5 

1. 81a: 4 - 540 x 3 y + 1350 x 2 y 2 - 1500a :y 3 + 625 y 4 . 

2. Since (1 + 3 t) 8 = J2t = o (fc)(^) fe ; we nee d k = 5. The coefficient is (®) 3 5 . 
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3. Since (2 — 5 1) 9 = ^2k=o (D^ 9 fc (— 5 t) k , we need k = 4, which implies that the coefficient 
of f 4 is Q 2 5 • (— 5) 4 . 

4. Since (3 - 2f 3 ) 8 = ^Lo (. l)3 8 ~ k (-2t 3 ) k = £)j*= 0 (l)3 8 ~ k {-2 ) k t 3k , we need 3fc = 9, or 
k = 3. The coefficient is (®)3 5 (— 2) 3 . 


5. Since 


4 =e 4 -e 


2 \ 


fc =0 x x ' fc =0 

we need 9 — 2 k = 0, which has no integral solutions. Hence, the constant term in ( x+2>/x)~ 
is zero. 


For the second problem, since 


2x — 


10 10 


= £(IV> 


\ 10— fc 


10 


= £ 


10 


10-fc/ n\k„W-2k 


= (-3) 


x 


k = 0 x x ' fc=0 

we need 10 — 2k = 0 or k = 5. The constant term is ( 1 5 °)2 5 (— 3) 5 . 

6. (g 2 ) • 2® - 2(1?) • 2 7 + 3(g 2 ) • 266. 

7. The 7tli, 8th, and 9th rows of Pascal’s triangle are displayed below. 


1 7 21 35 35 21 7 1 

1 8 28 56 70 56 28 8 1 

1 9 36 84 126 126 84 36 9 1 



278 


Appendix A Solutions to Hands-On Exercises 



Appendix B 


Answers to Selected Exercises 

Section 2.1 

1. Only (a), (c), and (e) are statements. 

3. (a) false (b) false (c) false (d) true 

5. (a) 7r ^ Z (b) l 3 + 2 3 + 3 3 ^ 3 2 • 4 2 /4 (c) u is not a vowel 

(d) This statement is either true or false. 

7. (a) true (b) true (c) true (d) false (e) false (f) true 

9. By definition, a rational number can be written as a ratio of two integers. After multiplying 
the numerator by 7, we still have a ratio of two integers. Conversely, given any rational 
number x, we can multiply the denominator by 7, we obtain another rational number y 
such that 7 y = x. Hence, the two sets 7Q and Q contain the same collection of rational 
numbers. In contrast, OQ contains only one number, namely, 0. Therefore, 0Q ^ Q. 

Section 2.2 

1. (a) p A q (b) gAr (c)pVg (d) (pV q) Ap A q 

3. (a) p A q; always false regardless of the value of r. 

(b) pVg; always true regardless of the value of r. 

(c) {p A q) V r; true if r is true, and false if r is false. 

(d) gAr; true if r is true, and false if r is false. 

5. (a) false (b) true 

7. (a) (4 < x) A (x < 7) (b) (4 < a:) A (a: < 7) (c) (4 < x) A (x < 7) 

Section 2.3 

1. (a) p => q (b) r => p (c ) p => q (d) p => r (e) (p A q) =>• r 
3. (a) p => q, which is false. 

(b) p => r, which is true if r is true, and is false if r is false. 

(c) (p V q) => r, which is true if r is true, and is false if r is false. 

5. (a) x 3 — 3x 2 + x — 3 = 0=>a: = 3 

(b) x 3 — 3x 2 + x — 3 = 0=>a; = 3 

(c) x = 3 => x 3 — 3x 2 + x — 3 = 0 
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p 

q 

r 

pAq 

(p A q) V r 


P 

q 

r 

PV q 

p A r 

(p V g) => (p A r) 

T 

T 

T 

T 

T 


T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

T 


T 

T 

F 

T 

F 

F 

T 

F 

T 

F 

T 


T 

F 

T 

T 

T 

T 

T 

F 

F 

F 

F 


T 

F 

F 

T 

F 

F 

F 

T 

T 

F 

T 


F 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 


F 

T 

F 

T 

F 

F 

F 

F 

T 

F 

T 


F 

F 

T 

F 

F 

T 

F 

F 

F 

F 

F 


F 

F 

F 

F 

F 

T 


9. (a) Using a truth table, we find that the implication (pAq) 
no truth value of p would make (p A q) => (q V r) false. 


(gVr) is always true. Hence, 


(b) From a truth table, we find that, (q A r) => (pA g) is false only when p is false. We 
can draw the same conclusion without using any truth table. An implication is false only 
when its hypothesis (in this case, q A r) is true and its conclusion (in this case, p A q) is 
false. For q A r to be true, we need both q and r to be true. Now q is true and p A q is 
false require p to be false. 


Section 2.4 

1. (a ) p <t=> g (b)r<t=>p (c)r<t=>(gAp) (d) r <t=> (p A g) 

3. (a) p <t=> q, which is false. 

(b) p <t=> r, which is true if r is true, and is false if r is false. 

(c) (p V g) <t=> r, which is true if r is true, and is false if r is false. 

5. (a) true (b) false (c) false (d) false 

7. We say n is odd if and only if n = 2g + 1 for some integer q. 


Section 2.5 


1 . 


P 

q 

pVg 

P v g 

P 

q 

PAq 

T 

T 

T 

F 

F 

F 

F 

T 

F 

T 

F 

F 

T 

F 

T 

T 

T 

F 

T 

F 

F 

T 

F 

F 

T 

T 

T 

T 


3. Only (b) is a tautology, as indicated in the truth tables below. 


P 

q 

P 

pVg 

{P V g) => p 

T 

T 

F 

T 

T 

T 

F 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

T 

F 


(b) 


P 

q 

p => q 

q 

p=> q 

(p => q) V (p => g) 

T 

T 

T 

F 

F 

T 

T 

F 

F 

T 

T 

T 

F 

T 

T 

F 

T 

T 

F 

F 

F 

T 

T 

T 
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p 

q 

r 

p=> q 

(p=> q)=>r 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

T 

F 

T 

F 

T 

T 

F 

F 

F 

T 

F 

T 

T 

T 

T 

F 

T 

F 

T 

F 

F 

F 

T 

T 

T 

F 

F 

F 

T 

F 


5. The proofs are displayed below without explanations. Be sure to fill them in. 


(b) (pAg)=tr e pAgVr ( ) 

= (pVg)Vr ( ) 

= pV(gVr) ( ) 

= P=t(gVr) ( ) 


(c) (p^ ? )A(p=tr) e (pVg)A(pVr) ( ) 

= p V (q A r) ( ) 

= pV q V r ( ) 

= pA(gVr) ( ) 


7. (a) Converse: 
Inverse: 
Contrapositive: 


If triangle ABC is a right triangle, then ABC is isosceles 
and contains an angle of 45 degrees. 

If triangle ABC is not isosceles or does not contain an angle 
of 45 degrees, then ABC is not a right triangle. 

If triangle ABC is not a right triangle, then ABC is not isosceles 
or does not contain an angle of 45 degrees. 


(b) Converse: 
Inverse: 

Contrapositive: 


If quadrilateral ABCD is both a rectangle and a rhombus, 
then ABCD is a square. 

If quadrilateral ABCD is not a square, 

then it is not a rectangle or not a rhombus. 

If quadrilateral ABCD is not a rectangle or not a rhombus, 
then ABCD is not a square. 


9. (a) true (b) true 


(c) false 


11. Only (b). 

13. (&)pAq (b) p A g (c)pAg 


Section 2.6 

1. (a) There exists an integer n such that n is prime and n is even. 

(b) For all integers n, if n > 2, then n is prime or n is even. 

(c) There exists an integer n such that n is prime, and either n is even or n > 2. 

(d) For all integers ?r, if n is prime and n is even, then n < 2. 

3. (a) true (b) true (c) false (d) false (e) true 

5. (a) 3a; < 0 3 y, z £ (y < z A xy < xz) 

(b) 3a; € Z [p(a:) A q(x)] 

(c) 3a;, y £ R. \p(x, y) A q{ x, y )] 
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7. (a) Var, y&M.(x + y = y + x) 

3a;, y € R (x + y ^ y + x) 

There exist real numbers x and y such that x + y ^ y + x. 

(b) Mx e R + By € R. (y 2 = x) 

3a; e R + \/y € R. {y 2 ^ x) 

There exists a positive real number x such that for all real numbers y, y 2 x. 

(c) By € RVx € Z (2x 2 + 1 > x 2 y) 

\/y € RBx € Z (2x 2 + 1 < x 2 y) 

For every real number y , there exists an integer x such that 2a; 2 + 1 < x 2 y. 

9. The statement “a square must be a parallelogram” means, symbolically, 

MPQRS ( PQRS is a square => PQRS is a parallelogram) , 
but the statement “a square must not be a parallelogram” means 

MPQRS ( PQRS is a square =>■ PQRS is not a parallelogram). 

The second statement is not the negation of the first. The correct negation, in symbol, is 
BPQRS ( PQRS is a square A PQRS is a parallelogram). 

In words, it means “there exists a square that is not a parallelogram.” 

Section 3.1 

1. Placing six dominoes horizontally in each row covers the entire chessboard. 

3. Let f(x) = x 3 — 12x + 2. From the following chart 


X 

-4 

-3 

-2 

-1 

0 

1 

2 

3 

4 

f{x) 

-14 

12 

18 

13 

2 

-9 

-14 

-7 

18 


we conclude there x 3 — 12a; + 2 = 0 has a solution between —4 and —3, another one between 
0 and 1, and a third one between 3 and 4. So it has at least three real solutions. 

Remark. The Fundamental Theorem of Algebra asserts that a real polynomial of degree n 
has at most n real roots. Hence, the given equation has exactly three real solutions. 

7. n = 3. 

Section 3.2 

1. No, 2 3 + 1 = 9 is composite. 

7. According to (i), the number y/2 is irrational. It follows from (ii) that \[2 = \f\/2 is also 
irrational. Applying (ii) one more time, we conclude that ypl = \J \[2 is irrational. 

8. (a) The statement is false, because (— 3) 2 > (— 2) 2 , but —3 ^ —2. 

(b) The statement is false, because when n = 41, 

n 2 + n + 41 = 41 2 + 41 + 41 = 41(41 + 1 + 1) = 41 • 43 


is composite. 




Appendix B Answers to Selected Exercises 


283 


Section 3.3 

1. (a) We will prove the contrapositive of the given statement. That is, we will prove that if 
n is odd, then n 2 is odd. If n is odd, we can write n = 2q + 1 for some integer q. Then 

n 2 = (2 q + if = 4q 2 +4q + l = 2(2 q 2 + 2 q) + 1, 

where 2 q 2 + 2 q is an integer. This shows that n 2 is odd. 

(b) Suppose the given statement is false. That is, suppose n 2 is even, but n is odd. Since 
n is odd, n = 2q + 1 for some integer q. Then 

n 2 = (2 q + If =4q 2 +4q + l = 2(2 q 2 + 2 q) + 1, 

where 2 q 2 + 2 q is an integer. This shows that n 2 is odd, which contradicts the assumption 
that n 2 is even. Therefore, the given statement must be true. 

9. Suppose there exist some numbers a fb such that a 2 + b 2 = 2 ab. Then 

0 = a 2 — 2 ab + b 2 = (a — bf 

would have implied that a = b. This contradicts the assumption that a f b. Therefore, 
a 2 + b 2 f 2 ab. 

15. Suppose (p q) V {p => q) is false for some logical statements p and q. For a disjunction 
to be false, we need 

• p =>■ q to be false, and 

• p => q to be false. 

They in turn require 

• p to be true and q to be false, and 

• p to be true and q to be false. 

Having q false would imply q is true, which contradicts what we found. Therefore, the 
given logical formula is always true, hence, a tautology. 


Section 3.4 


1. We proceed by induction on n. When n = 1, the left-hand side of the identity reduces to 
1 J = 1, and the right-hand side becomes = 1. Hence, the identity holds when n = 1. 
Assume the identity holds when n = k for some integer k > 1; that is, assume 

o o o o k 2 (k + lf 

l 3 + 2 3 + 3 3 + • • • + A; 3 = — ^ - 

4 


for some integer k > 1. We want to show that it also holds when n = k + 1; that is, we 
want to show that 


l 3 + 2 3 + 3 3 + • • • + (fc + l) 3 


(k + lf(k + 2f 
4 


Using the inductive hypothesis, we find 
l 3 + 2 3 + 3 3 + • • • + (k + l) 3 


l 3 + 2 3 + 3 3 H \-k 3 + (k + l) 3 

fc 2 (fc + l) 2 3 

4 +(fc + l ) 3 

(k + l) 2 [fc 2 + 4(fc + 1)] 

4 

(k + lf(k 2 + 4/c + 4) 


(k + lf(k + 2f 


4 
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Therefore, the identity also holds when n = k + 1. This completes the induction. 


Section 3.5 

1. We proceed by induction on n. When n = 1, the product n(n+l)(n+2) becomes 1-2-3 = 6, 
which is obviously a multiple of 3. Hence, the claim holds when n = 1. Assume the claim 
holds when n = k for some integer k > 1; that is, assume that k(k + l)(fc + 2) is a multiple 
of 3 for some integer k > 1. Then we can write 

k(k + 1 )(k + 2) = 3q 

for some integer q. We want to show that the claim is still valid when n = k + 1. That is, 
we want to show that (k + l)(fc + 2 )(k + 3) is also a multiple of 3. So we want to find an 
integer Q such that 

(k + l)(fc + 2)(fc + 3) = 3 Q. 

We note that, using the inductive hypothesis, 

(k + l)(fc + 2)(fc + 3) = k(k + l)(k T 2) + 3(fc + l)(k + 2) 

= 3 g + 3(k + 1 )(fc + 2) 

= 3 [q + (k + l)(/c + 2)], 

where q + {k + 1 )(k + 2) is an integer. Hence, (k + 1 )(k + 2 )(k + 3) is a multiple of 3. This 
completes the induction. 

11. (b) S n = 1 — f° r a ll integers n > 1. 

12. (b) T n = f or a n integers n > 0. 


Section 3.6 


1. We proceed by induction on n. When n = 1, the left-hand side of the identity reduces 
to F-j 2 = l 2 = 1, and the right-hand side becomes Ft F 2 = 1-1 = 1. Hence, the identity 
holds when n = 1. Assume the identity holds when n = k for some integer k > 1; that is, 
assume 


Fl + Fi + Fi + • • • + Fi = F k F k+1 


for some integer k > 1. We want to show that it also holds when n = k + 1; that is, we 
want to show that 


+ F% + F 3 2 + • • • + F k+1 — F k+1 F k+2 . 


Using the inductive hypothesis, we find 


Fl + Fl + Fl + ■ 


fc +1 


Fl + Fl + Fl + • • • + Fl + Fl +1 
F k F k + i + Fl +1 
F k +i{F k + F k+ 1) 

F k +iF k +2- 


Therefore, the identity also holds when n = k + 1. This completes the induction. 


Section 4.1 

1. (a) {-5, -4, -3, -2, -1, 0, 1, 2, 3} (b) {1, 2, 3} (c) {0, -2, 3} (d) {-3, 3} 

3. (a) {n € Z | n < 0} 

(b) {n € Z | n is a perfect cube} 

(c) {n € Z | n is a perfect square} 
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5. (a) Z" (d) 5Z (f) 4 + 6Z 

Remark. We cannot write (b) as Z 3 and (c) as Z 2 , because Z 3 and Z 2 mean something 
else. If we drop 0 from (e), then {4, 8, 12, . . .} = 4N. However, the inclusion of 0 makes it 
harder to describe (d) in the form of AS. 

7. (a) (-4,7) (b) (-4,7] (c) (0,7] 

9. (a) 10 (b) 11 (c) 7 

11. (a) true (b) true (c) true (d) false 

13. (a) It is incorrect to write (3, 7] = 3 < x < 7 because (3, 7] is a set, but 3 < x < 7 is a 
logical statement. 

(b) No, because both {x G K. | x 2 < 0} and 0 are sets, so we should use an equal sign to 
compare them. The notation = only applies to logical statements. The correct way to say 
it is “{x G R | x 2 < 0} = 0.” 

Section 4.2 

1. (a) true (b) true (c) true (d) true (e) true (f) false 

3. We have Z C N because every integer n is also a rational number, as we can write it as 
the rational number j. 

5. Yes, this is the transitive property. 

7. (e) {0, {a},{{b}},{a, {b}}} 

11. (a) False, because the set {a} cannot be found in {a, b, c} as an element. 

(b) False, because a, the sole element in {a}, cannot be found in {{a}, b, c} as an element. 

(c) False. For {a} G p({{a}, b, c}), the set {a} must be a subset of {{a}, b, c}}. This means 
a must belong to {{a}, b, c}, which is not true. 

Section 4.3 

1. (a) {-4, -3, -2, -1,0, 1,2, 3, 4} 

(b) {-3, -2, -1,0, 1,2, 3, 4} 

(c) {-3, -2, -1,0, 1,2, 3,...} 

3. (a) false (b) false 

5. (a)£flB (b)EUB 

7. For example, take A = {x}, and B = {{x},x}. 

9. Assume ACC and B C C, we want to show that A\JB C C. In this regard, let x G AUJ3, 

we want to show that x G C as well. Since x G A U B, the definition of set union asserts 

that either x G A or x G B. 

• Case 1: If x G A, then ACC implies that x G C. 

• Case 2: If x G B, then B C C implies that x G C. 

In both cases, we find x G C. This proves that AuBCC. 
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13. (a) The notation n is used to connect two sets, but “x € A” and “x € B ” are both logical 
statements. We should also use <t=> instead of =. The statement should have been written 
as“ieiAieB»iedn B.” 

(b) If we read it aloud, it sounds perfect: 

If x belongs to A and B , then x belongs to AnB. 

The trouble is, every notation has its own meaning and specific usage. In this case, A 
is not exactly a replacement for the English word “and.” Instead, it is the notation for 
joining two logical statements to form a conjunction. Before A, we have “x G A,” which is 
a logical statement. But, after A, we have “B,” which is a set, and not a logical statement. 
It should be written as “x € A A x G B => x € A n B.” 

Section 4.4 

1. (a) {(-2,0), (-2, 4), (2,0), (2, 4)} 

(b) {(-2, -3), (-2, 0), (-2, 3), (-2, -3), (-2, 0), (-2, 3)} 

3. 2 • 2 • 2 • 3 = 24. 

5. (a) {(-2, 0), (-2, {-2}), (-2, {2}), (-2, {-2, 2}), (2, 0), (2, {-2}), (2, {2}), (2, {-2, 2})} 

Section 4.5 

1. fln =1 A„ = [0,2), lT=iA„ = (-l,oo). 

3. fr=o^ = 0> U” o C„=NU{0}. 

5' rinSN = Eq = {0} , UnSN = 2k 

”• Uie/ A-i = [1, °o), = 

9 - fU(i, 2 )(! - 2x,x 2 ) = [-1,1], Uo :6 (i, 2 )( 1 - 2a;,a: 2 ) = (-3 .4). 

11- a e( o,oc)^ = {(0,0)}, Ure(o,oo) A r = R* x M+ U {(0, 0)}. 

Section 5.1 

1. (a) 3 (b) 3 (c) 3 (d) 1 

3. We claim that the subset (3,5) does not have a smallest element. To see why, suppose it 
has a smallest element x. The midpoint between 3 and x is the number , an d 

„ 3 + x 

3 < — - — < x < 5. 

This means j s a l so inside the interval (3, 5), and is smaller than x. This contradicts the 
minimality of x. Thus, the interval (3,5) does not have a smallest element. Consequently, 
the interval (3, 5] is not well-ordered. 

5. We know that N is well-ordered. Since 2N is a subset of N, and 2N is clearly nonempty, 
we conclude from Problem 4 that 2N is also well-ordered. 
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Section 5.2 

1. (a) 23, 1 (b) -11, 1 (c) -6, 13 

3. This is an immediate consequence of Corollary 5.2.2. 

5. (a) Let n be any integer. Then n mod 3 = 0, 1, 2. 

• Case 1: if n mod 3 = 0, then n = 3q for some integer q , and 

n 3 — n = (3 q) 3 — 3 q = 27 q 3 —3 q = 3(9 q 2 — q), 
where 9 q 2 — q is an integer. 

• Case 2: if n mod 3 = 1, then n = 3q + 1 for some integer q , and 

n 3 — n = (3q + l) 3 - (3 q + 1) = 27 q 3 + 27 q 2 + 6 q = 3(9 q 3 + 9 q 2 + 2 q), 

where 9 q 3 + 9 q 2 + 2 q is an integer. 

• Case 2: if n mod 3 = 2, then n = 3q + 2 for some integer q, and 

n 3 -n = (3 q + 2) 3 - (3 q + 2) = 27 q 3 + 54 q 2 + 33 q + 6 = 3(9 q 3 + 18 q 2 + 11 q + 2), 
where 9<; 3 + 18q 2 + 11 q + 2 is an integer. 

In all three cases, we have shown that n 3 — n is a multiple of 3. 

(b) We note that 

n 3 — n = n(n 2 — 1) = n(n — l)(n + 1) = (n — 1 )n(n + 1) 

is a product of three consecutive integers. As we have seen in Problem 4, any three 
consecutive integers must contain a multiple of 3. It follows that their product is also a 
multiple of 3. 

7. (a) s + t (b) 4 

Section 5.3 

1. Assume a \ b and c | (—a). There exist integers x and y such that b = ax and —a = cy. 
Then 

b = ax = (— a)(— x) = cy ■ {—x) = (— c) • xy, 
where xy is an integer. Thus, (— c) | b. 

7. There are three cases, depending on the remainder when an integer is divided by 3. 

• (3g) 2 = 9<? 2 = 3 • 3 q 2 . 

• (3<7 + l) 2 = 9 q“ + 6q + 1 = 3(3<7“ + 2 q) + 1. 

• (3 q + 2) 2 = 9 q 2 + 12q + 4 = 9q 2 + 12q + 3 + 1 = 3(3 q 2 +4q+l) + l. 

In each case, we have shown that the square of an integer is of the form 3 k or 3fc + 1. 

Section 5.4 

1. (a) 1 • 27 + 0 • 81 = 27 (b) -3 • 24 + 1 • 84 = 12 (c) -35 • 1380 + 16 • 3020 = 20 

7. 1, 2, 17, and 34. 
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Section 5.5 

1. Since 

-3 • (2n + 1) + 2- (3n + 2) = 1, 
we deduce that gcd(2n + 1, 3 n + 2) = 1. 

5. Let a, 6, and c be positive integers such that a \ c, b \ c, and gcd (a,b) = 1. Then there 
exist integers x and y such that c = ax and c = by; and there exist integers s and t such 
that sa + tb = 1. It follows that 

c = c • 1 = c(sa + tb) = csa + ctb. 

Using c = ax and c = by, we find 

c = csa + ctb = by ■ sa + ax ■ tb = ab(ys + xt), 

where ys + xt is an integer. Thus, ab \ c. 

Section 5.6 

1. (a) 3 2 • 5 2 • 7 (b) 2 • 3 2 • 7 2 • 11 

2. (a) 81 (b) 168 

3. Every 50 days. 

5. Assume x £ 10Z fl 15Z, then x £ 10Z and x £ 15Z. This means x is a multiple of both 10 
and 15. Consequently, a; is a multiple of lcm(10, 15) = 30, which means x £ 30Z. Thus, 
ioz n 15Z c 30Z. 

Next, assume x £ 30Z, then a; is a multiple of 30. Consequently, a; is a multiple of 10, as 
well as a multiple of 15. This means x £ 10Z, and x £ 15Z. As a result, x £ 10Z n 15Z. 
Thus, 30Z C 10ZC15Z. Together with 10ZH15Z C 30Z, we conclude that 10ZH15Z = 30Z. 

7. (a) When p is divided by 4, its remainder is 0, 1, 2, or 3. But p is odd, hence, p is of the 
form 4/c + 1 or 4/c + 3 for some integer k. Since p > 3, we also need k to be a nonnegative 
integer. 

(b) When p is divided by 6, its remainder is 0, 1, 2, 3, 4, or 5. But p is odd, hence, p is of 
the form 6fc + 1, 6k + 3, or 6k + 5. We rule out the form 6A; + 3 because this would make 
p a multiple of 3. Hence, p is of the form 6k + 1 or 6k + 5 for some nonnegative integer k. 


Section 5.7 

1. The addition and multiplication tables for Zs are listed below. 


+ 

0 

1 

2 

3 

4 

5 

6 

7 

0 

0 

1 

2 

3 

4 

5 

6 

7 

1 

1 

2 

3 

4 

5 

6 

7 

0 

2 

2 

3 

4 

5 

6 

7 

0 

1 

3 

3 

4 

5 

6 

7 

0 

1 

2 

4 

4 

5 

6 

7 

0 

1 

2 

3 

5 

5 

6 

7 

0 

1 

2 

3 

4 

6 

6 

7 

0 

1 

2 

3 

4 

5 

7 

7 

0 

1 

2 

3 

4 

5 

6 



0 

1 

2 

3 

4 

5 

6 

7 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

5 

6 

7 

2 

0 

2 

4 

6 

0 

2 

4 

6 

3 

0 

3 

6 

1 

4 

7 

2 

5 

4 

0 

4 

0 

4 

0 

4 

0 

4 

5 

0 

5 

2 

7 

4 

1 

6 

3 

6 

0 

6 

4 

2 

0 

6 

4 

2 

7 

0 

7 

2 

5 

4 

3 

2 

1 


and 


Only 1, 3, 5, and 7 have multiplicative inverses. In fact,l 1 = 1, 3 1 = 3, 5 1 = 5, 
7~ 1 = 7. 
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3. The sum is 9, and the product is 7. 
5. From the following computation 


m (mod 7) 

ra 2 + 1 (mod 7) 

0 

0 2 + 1 = 1 

±1 

l 2 + 1 = 2 

±2 

2 2 + 1 = 5 

±3 

3 2 + 1 = 10 = 3 


we determine that m 2 + 1^0 (mod 7). Hence, m 2 + 1 is not a multiple of 7 for all integers 
in. 

7. Both methods give 4 45 = 1 in Zn. 

9. (a) 9 

Section 6.1 

1 . 


3. 

Section 6.2 

L [|,oo). 

3. Only g is a well-defined function. The image /( 4) is undefined, and there are two values 
for h( 3). Hence, both f and h are not well-defined functions. 

5. (a) Yes, because no division by zero will over occur. 


9. (a) 7 (b) 7 (c) 3 


X 

1 

2 

3 

4 

q(x) 

2 

3 

1 

3 


X 

1 

2 

3 

4 

p{x) 

3 

1 

2 

2 


X 

5.7 

7 r 

e 

-7.2 

-0.8 

9 

IaJ 

5 

3 

2 

-8 

-1 

9 


6 

4 

3 

-7 

0 

9 

M 

6 

3 

3 

-7 

-1 

9 


[0, oo). 


Section 6.3 


1. (a) No. For example, /( 0) = /( 2) = 1. 

(b) Yes, since g'( x) = 3x 2 — 4x = x(3x — 4) > 0 for x > 2. 


3. Because the domain and the codomain are half-open intervals, we need to be careful with 
the inclusion and exclusion of the endpoints. We can use the graph displayed below on 
the left. 
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We find f(x) = § x + 

5. (a) One-to-one (b) Not one-to-one 
7. (a) Not one-to-one (b) One-to-one 

9. There are twelve one-to-one functions from {1,2} to {a, b, c, d}. The images of 1 and 2 
under them are listed below. 



h 

h 

h 

h 

h 

h 

h 

fs 

h 

fio 

fn 

/12 

1 

a 

a 

a 

b 

b 

b 

C 

c 

C 

d 

d 

d 

2 

b 

c 

d 

a 

c 

d 

a 

b 

d 

a 

b 

c 


11. (a) One-to-one (b) Not one-to-one (c) Not one-to-one 

Section 6.4 

1. (a) Yes! It is not easy to express x in terms of y from the equation y = x 3 — 2x 2 + 1. 

However, from its graph, we can tell that the y-values cover all the possible real values in 

the codomain. 

(b) No, because g(x) > 1. 

5. (b) Not onto (c) Onto 

7. (b) Not onto (c) Onto 

9. No, because we have at most two distinct images, but the codomain has four elements. 
11. (a) Onto (b) Not onto (c) Not onto 

Section 6.5 

1. (a) fi(A) = {a, b}, ft\B) = {2, 3, 4, 5} 

(b )f 2 (A) = {a,c}, fc 1 (B) = { 2,4} 

(c) f 3 (A) = {b,d}, / 3 - 1 (B)=0 

(d) / 4 (A) = {e}, fr 1 (B) = { 5} 

3. The images of s are tabulated below. 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

s(x) 

7 

11 

3 

7 

11 

3 

7 

11 

3 

7 

11 

3 


(a) {3,11} (b) {0,3, 6, 9} (c) {3,7,11} 

5. (a) [20,26); {20,23,26} (b) [-3, -|); {-2} 

7. (a) {§,§,f,3,9,27, 15,45, 135} (b) {(-3,2)} (c) N x {0} 

9. For a function to be well-defined, each row sum must be 1. For the function to be one-to- 
one, each column sum must be at most 1. For the function to be onto, each column sum 
must be at least 1 (hence, no column sum is zero). 

13. Let y € /(Ci) — /(C 2 ), we want to show that y € /(Ci — C 2 ) as well. Since y G /(Ci) — 
f(C 2 ), we know there exists x € A such that f(x) = y. Having y € /(C/) — f(C — 2) means 
y € /(Ci) but y £ /(C 2 ). Hence, x € Ci but x ^ C 2 . In other words, x € Ci — C 2 . This 
leads to y = f(x) € f{C\— C 2 ). This completes the proof that /(Ci) — /(C 2 ) C f{C\— C 2 ). 

17. {0,1, 4, 9}; {0, ±1, ±2, ±3}. 
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Section 6.6 

1. Only (e) is bijective. 

3. Their inverse functions / _1 , </ _1 : (4, 7) — ► (1, 3) are defined by 



and g 1 (x) 



r ri m i _i/ n f x - 3 if 4 < x < 5, 

5 -g : [4, 7] ->[1,3], where g (x) = | i (n _ x) if 5 < * < 7 . 

7. s _1 : (—oo, —3) — > R, where s _1 (a:) = \ In (^=^). 

9. (a) at -1 : Q — » Q, u~ x (x) = (x + 2)/3 

11. The images under a” 1 : {a, b, c, d, e, /, g, /i} — ► {1, 2, 3, 4, 5, 6, 7, 8} are given below. 


X 

a 

6 

c 

d 

e 

/ 

3 

h 

a -1 (a;) 

2 

5 

8 

3 

6 

7 

1 

4 


Section 6.7 

1. Both / o g and g o f are from R to R, where (/ o g)(x) = 15a; 2 + 19, and (g o /)(a;) = 
75a; 2 - 30a; + 7. 

3. We do not need to find the formula of the composite function, as we can evaluate the 
result directly: f(g(f( 0))) = f(g( 1)) = /( 2) = -5. 

5. (a) gof:Z ->• Q, (g o /)(n) = l/(n 2 + 1) 

(b)s°/:R-> (0, 1), (go /)( x) = a: 2 /(a; 2 + 1) 

7. (a) go f: {1, 2, 3,4, 5} — ► {1, 2,3,4, 5}, 

(5 0 /)(!) = 2, (g o /)( 2) = 5, (5 o /)(3) = 1, (5 o /)(4) = 3, (g o /)( 5) = 4 


9. go f:Z ->• Z, (g ° f)(n) 


3(2 n — 1) if n > 0, 
2n + 1 if n < 0. 


11. (a)/o fl :Z->Z, (f o g)(n) = 3 - n 

(f°g)~ l :Z -> Z, (fog)~ 1 (n) = 3-n 

/ _1 :Z-»Z, / _1 (n) = 2 — n 

g~ 1 -. Z — > Z, g _1 (n)=n — 1 

5” 1 o /-^Z -» Z, (g- 1 o = 3 - n 



12 3 6 

1 / 1 0 0 0 \ 

2 0 10 0 

3 0 0 1 0 

6 \ 0 0 0 1 / 
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2. (a) domain = image = {1, 2, 3, 6}. 

(b) domain = image = {1, 2, 3, 6}. 

(c) domain = {1,2, 3}, image = {2, 3, 6}. 




1 

2 

3 

6 

1 

f 0 

1 

1 

1 \ 

2 

1 

0 

1 

1 

3 

1 

1 

0 

1 

6 

l 1 

1 

1 

o / 



1 

2 

3 

6 

1 

f 0 

1 

1 

1 \ 

2 

0 

0 

1 

1 

3 

0 

0 

0 

1 

6 

^ 0 

0 

0 

0 ) 


1 

2 

4 

5 

10 

20 


1 2 4 5 10 20 

/ 0 1 1 1 1 1 \ 

0 0 10 1 1 

0 0 0 0 0 1 

0 0 0 0 1 1 

0 0 0 0 0 1 

\ 0 0 0 0 0 0 




0 

{ 1 } 

{ 2 } 

{ 1 , 

2 } 

0 

( 

0 

0 

0 

0 

\ 

{ 1 } 

0 

1 

0 

1 

{ 2 } 


0 

0 

1 

1 


{ 1 , 2 } 

\ 

0 

1 

1 

1 

/ 


Section 7.2 

1. (a) Reflexive, symmetric, antisymmetric, and transitive. 

(b) Irreflexive, and symmetric. 

(c) Irreflexive, and transitive. 


2. (a) Antisymmetric. 

(b) Reflexive, symmetric, and transitive. 

(c) Irreflexive, symmetric, and transitive. 


3. Reflexive, symmetric, and transitive. 

4. Antisymmetric, and transitive. 

5. Irreflexive, and antisymmetric. 


6. Symmetric. 
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7. (a) A is not reflexive because (A, X) ^ A if X ^ 0. 

(b) A is not irreflexive because (0,0) € A. 

(c) No. For example, consider S = {a,b,c}, X = {a}, Y = { b }, and Z — {a, c}. Then 
(X, Y) G A, (y, Z) G A, but (X, Z) £ A. 

( d ) 0 {a} {6} {c} {a, b} {a,c} {b, c} {a,b,c} 



0 M {&} {c} {a, 6} {a, c} {6, c} {a, 6,c} 








r-Cl 

o 


o 

1-0 















Si 









0 

( 

1 

1 

1 

1 

1 

1 

1 

1 

\ 

M 


1 

0 

1 

1 

0 

0 

1 

0 


{b} 


1 

1 

0 

1 

0 

1 

0 

0 


to 


1 

1 

1 

0 

1 

0 

0 

0 


{a,b} 


1 

0 

0 

1 

0 

0 

0 

0 


{a, c} 


1 

0 

1 

0 

0 

0 

0 

0 


{ b , c} 


1 

1 

0 

0 

0 

0 

0 

0 


{a,b,c} 

V 

1 

0 

0 

0 

0 

0 

0 

0 

) 

8. (a) Symmetric. 











(b) Reflexive, and symmetric. 








9. (a) Reflexive, antisymmetric, and transitive. 

(b) Reflexive, symmetric, and transitive. 

(c) Symmetric. 


10. (a) Reflexive, antisymmetric, and transitive. 

(b) Symmetric. 

(c) Symmetric, and transitive. 

11. (a) Reflexive, and transitive. 

(b) Symmetric, 

(c) Reflexive, symmetric, and transitive. 

12. (a) Symmetric, and transitive. 

(b) Reflexive, symmetric, and transitive. 

(c) Reflexive, and transitive. 

Section 7.3 

1. (a) The equivalence classes are of the form {3 — k, 3 + k} for some integer k. For instance, 
[3] = {3}, [2] = {2, 4}, [1] = {1, 5}, and [-5] = {-5, 11}. 

(b) There are three equivalence classes: [0] = 3Z, [1] = 1 + 3Z, and [2] = 2 + 3Z. 

3. (a) True 

(b) False 

(c) [{1,5}] = {{1},{1, 2}, {1,4}, {1,5}, {1,2, 4}, {1,2, 5}, {1,4, 5}, {1,2, 4, 5}} 
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(d) [A] = {(A fl T) U y I y G p(T)}. In other words, S ~ A if S contains the same 
element in A (~l T, plus possibly some elements not in T. 

5. (a) Yes, with [(a, b)] = {(#, y) \ y = x + k for some constant k}. In other words, the 
equivalence classes are the straight lines of the form y = x + k for some constant k. 

(b) No. For example, (2,5) ~ (3,5) and (3,5) ~ (3,7), but (2,5) / (3,7). Hence, the 
relation ~ is not transitive. 

7. We find [0] = \ Z = {f | n G Z}, and [j] = \ + \ Z = | n G Z}. 

Section 7.4 

1. The Hasse diagram is shown below. 





1 


3. Let a G B, since B C A, we also find a £ A. Since (A, X) is a poset, the relation A on 
A is reflexive, hence, a A a. This shows that A is still reflexive when restricted to B. 
Antisymmetry and transitivity are proved with a similar argument. 

5. (b) The Hasse diagram is shown below. 


-2 2 



0 


7- B = {0,{a},{a,5},{a,6,c},{a,6,c,d}}. 


Section 8.2 

1 . 6 . 

3. 70. 

5. 7-5 + 7-4 + 5-4 
7. 4 5 , 4 5 — 3 • 4 2 
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9. (a) 52 4 (b) 39 4 (c) 4 • 13 4 (d) 4 • 48 • 52 3 (e) 52 4 - 48 4 

11. (a) 9 • 10 3 (b) 8 • 9 3 (c) 9 • 10 3 - 8 • 9 3 (d) 9 • 10 

13. (a) 8 6 (b) 8 • 7 • 6 • 5 • 4 • 3 (c) 0 (d) 8 6 - 4 6 (e) 4 • 8 4 (f) 7 5 

Section 8.3 

1. 62 8 , P( 62,8). 

3. P(14, 5). 

5. p{ 7, 3) • P(10, 3) + P(7, 3) • P(ll, 3) + P(10, 3) • P(ll, 3). 

7. P(ll, 7) • 3!/7. 

Section 8.4 

i- (S)(D- 

3. (a) at least 5 (b) at least 7 

5. 10. 

7. (a) (?) (b) (?) - (?) (c) (?) (I) (?) + (?) © (?) + (?) (?) (?) 

9. (a) 8! (b) (?) P(8, 2) [(?) P( 8, 2)+2-7-6-7 + 7-6] 

11 . (?). 

13. (a) ( 52 ) (b) 4 (?) 13 3 (c) 13 (?) (?) 4 3 (d) 13 (?) (?) 4 2 

(e) 13 (?) 12 (?) (f) 10 • (4 5 - 1) (g) 4 [(?) - 10] (h) 4 • 10 

Section 8.5 

1. (a) x 5 + 5x 4 y + 10 x 3 y 2 + 10x 2 y 3 + 5 xy 4 + y 5 

(b) s 6 — 6s 5 t + 15s 4 < 2 — 20s 3 t 3 + 15s 2 t 4 — 6 st 5 + t 6 

(c) a 4 + 12 o 3 5 + 54a 2 6 2 + 108a6 3 + 816 4 

3. (a) (?) = 6 (b) -(?) 3« (l) 3 = (e) 0 (d) -(«) 3 3 (f) 3 = 

5- ELo ©r fc = (l + r)" 

7. (c) P = 2(?) + (?) (d) £Li P = \n{n+ 1)(2 n + 1) 



Index 


r-combination, 235 
r-permutation, 230 

additive identity, 152 
additive inverse, 152 
affirmation of the consequence, 50 
algebraic structure, 152 
anchor step, 58 
antecedent, 17 

antisymmetric property, 201, 202 
arc, 199 

arrow diagram, 160 
assertion, 9 

associative property, 31, 100, 152 
basis step, 58 

biconditional statement, 24 
bijection, 184 
bijective function, 184 
binary operator, 14 
binary string, 221 
binomial coefficient, 244 
binomial theorem, 243 

canonical factorization, 142 
cardinality, 86 
Cartesian product, 105 
n-fold, 106 
ceiling function, 158 
chain, 218 

circular permutation, 233 

clock arithmetic, 146 

closed interval, 83 

codomain, 157, 159 

coefficient, 76 

collection of sets, 86 

combination, 235 

common divisor, 129 

common factor, 129 

commutative property, 31, 100, 152 

complement, 98 

complete relation, 202 

component, 209 

composite function, 189 

composite number, 126 

compound statement, 14 


conclusion, 17 
conditional statement, 18 
congruent, 146 
conjunction, 14 
consequence, 17 
constant function, 165 
constructive proof, 44 
contingency, 28 
contradiction, 28 
contrapositive, 20 
converse, 20 
corollary, 51 

decreasing function, 166 
denial of the antecedent, 50 
De Morgan’s laws, 31, 100 
extended, 114 
generalized, 102 
digraph, 199 
direct proof, 47 
directed edge, 199 
directed graph, 199 
directed line, 199 
disjoint sets, 222 
disjoint union, 222 
disjunction, 14 

distributive laws, 31, 100, 152 
dividend, 120 
divisibility, 125 
division algorithm, 120 
divisor, 120, 125 
domain, 157, 159 
domain of a relation, 199 
domination laws, 31 
dot, 199 

element, 81 
empty relation, 202 
empty set, 85 
equality of sets, 85 
equivalence class, 209 
equivalence relation, 208 
Euclid’s lemma, 138 
Euclidean algorithm, 130 
extended, 133 
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even integer, 48, 126 
exclusive or, 15, 17 
existential quantification, 38 
existential quantifier, 38 

factor, 125 

fallacy of the converse, 50 
fallacy of the inverse, 50 
family of sets, 86 
Fibonacci numbers, 73 
held, 152 
finite set, 86 
floor function, 158 
function, 159 

fundamental theorem of arithmetic, 141 
fundamental theorem on equivalence relations, 
211 

Goldbach Conjecture, 11 
graph, 162 

greatest common divisor, 129 

half-open interval, 83 
Hasse diagram, 217 
hypothesis, 17 

idempotent laws, 31, 100 
identity function, 165 
identity laws, 31 
identity relation, 202 
image, 159 

image of a function, 157, 177 
image of a relation, 199 
image of a set, 176 
implication, 17 
incidence matrix, 163, 199 
inclusive or, 15 
increasing function, 166 
index of summation, 61 
index set, 112 
indexed family, 112 
indirect proof, 52 
induction hypothesis, 59 
inductive hypothesis, 59 
inductive step, 58 
initial step, 58 
injection, 165 
injective function, 165 
intersection, 97 
interval notation, 83 
inverse, 20 
inverse function, 184 
inverse laws, 31, 100 
irreflexive property, 201, 202 


law of detachment, 47 

law of syllogism, 47 

laws of the excluded middle, 31, 100 

least common multiple, 143 

linear combination, 127 

linear combination, 76, 132 

linear ordering, 218 

logical operator, 14 

logical connectives, 14 

logical equivalence, 30 

many-to-one function, 165 
map, 160 
mapping, 160 

mathematical induction, 58 
strong form, 74 
weak form, 74 
matrix representation, 199 
member, 81 

modular arithmetic, 146 
modulus, 146 
modus ponens, 47 
money changing problem, 77 
multiple, 48, 125 
multiplication principle, 108 
multiplicative identity, 152 
multiplicative inverse, 152 
multiset, 88 

nearest integer function, 157 
necessary condition, 22 
necessity, 93 
negation, 11 
negative, 152 
non-trivial divisor, 126 

odd integer, 48, 126 
one-to-one function, 165 
onto function, 158, 171 
open interval, 83 
operand, 14 
ordered pair, 104 
ordered n-tuples, 106 

pairwise disjoint sets, 123, 222 
part, 209 

partial ordering, 216 
partial-order relation, 216 
partially ordered set, 216 
partition of a set, 123, 209 
Pascal triangle, 248 
Pascal’s identity, 248 
permutation, 230 
poset, 216 

postage stamp problem, 77 
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power set, 94 
precedence, 25 
predecessor, 217 
predicate, 36 
preimage of a set, 179 
premise, 17 
prime, 126 

prime-power factorization, 142 

principle of inclusion-exclusion, 223 

principle of mathematical induction, 58 

principle of well-ordering, 117 

priority, 25 

proof by cases, 49 

proof by contradiction, 53 

proof by contrapositive, 52 

proper divisor, 126 

proper subset, 93 

proposition, 9 

propositional function, 36 

propositional variable, 11 

quotient, 120 

range, 158, 177 
range of a relation, 199 
reciprocal, 152 
recurrence relation, 73 
reflexive property, 201, 202 
relation between two sets, 197 
relation on a set, 201 
relative complement, 98 
relative prime, 132 
remainder, 120 
repeated squaring, 150 
residue classes modulo n, 212 
residues, 147 
roster method, 81 

set, 81 

set difference, 98 

set of integers modulo n, 151 

set-builder notation, 81 

short circuit evaluation, 14 

sigma notation, 61 

statement, 9 

subposet, 219 

successor, 217 

sufficiency, 93 

sufficient condition, 22 

summation notation, 61 

superset, 90 

surjection, 171 

surjective function, 171 

symmetric property, 201, 202 


tautology, 28 

total ordering, 217 

transformation, 160 

transitive property, 92, 127, 201, 202 

trivial divisor, 126 

truth table, 11 

twin primes, 38 

unary operator, 14 
union, 97 

universal quantification, 37 
universal quantifier, 37 
universal set, 89 

Venn diagram, 90 
vertex, 199 

well-ordered set, 118 

Young tableau, 221 



